A BOUNDARY VALUE PROBLEM FOR A SYSTEM 
OF ORDINARY LINEAR DIFFERENTIAL 
EQUATIONS OF THE FIRST ORDER* 


BY 
GILBERT AMES BLISS 


The boundary value problem to be considered in this paper is that of 
finding solutions of the system of differential equations and boundary con- 
ditions 

dy; 
— = + ABia(2) 


dx a=1 


(1) 

[Miaye(a) + Nixya(6)] = 0 (i=1,2,--- m). 
Such systems have been studied by a number of writers whose papers are 
cited in a list at the close of this memoir. The further details of references 
incompletely given in the footnotes or text of the following pages will be 
found there in full. 

In 1909 Bounitzky defined for the first time a boundary value problem 
adjoint to the one described above, and discussed its relationships with the 
original problem. He constructed the Green’s matrices for the two problems, 
and secured expansion theorems by considering the system of linear integral 
equations, each in one unknown function, whose kernels are the elements 
of the Green’s matrix. In 1918 Hildebrandt, following the methods of E. H. 
Moore’s general analysis, formulated a very general boundary value problem 
containing the one above as a special case, and established a number of 
fundamental theorems. In 1921 W. A. Hurwitz studied the more special 
system 
du 
dx 
aou(0) + Bov(0) = 0, a,u(1) + Biv(1) = 0 


and its expansion theorems, by the method of asymptotic expansions, and 
in 1922 Camp extended his results to a case where the boundary conditions 
have a less special form. Meanwhile Schur in 1921 published very general 
expansion theorems for the original system (1) under the hypothesis that 
the matrix of functions B,(x) has all elements identically zero except those 


dv 
= [a(x) + r]o(x), + 


(2) 


* Presented to the Society, December 30, 1924; received by the editors in December, 1925. 
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in the principal diagonal, which are real, distinct, and positive for every 
value of x. The cases of Hurwitz and Camp are not included under 
this one even after a linear transformation. Carmichael exhibited in 1921- 
22 the analogies between certain algebraic problems and boundary value 
problems of many types, including those of the type (1), but without giving 
the details of the theory of the boundary value problems themselves. In 
1923 Birkhoff and Langer showed that the large class of systems (1) for 
which the determinant of the functions By(x) is different from zero can 
be reduced, by a linear transformation of the functions »,(x) whose coeffi- 
cients may be imaginary, to the simpler form for which all the functions 
B(x) with i#k vanish identically. They obtained expansion theorems 
under the further hypotheses that the functions B,;(x) are all distinct from 
each other and from zero, though not necessarily real and positive as in 
Schur’s paper, and that they satisfy certain other more artificial restric- 
tions.* The problems considered by Hurwitz, Camp, and Schur are among 
those included in their theory. An abstract of the paper of *Birkhoff and 
Langer was printed in 1922. 

The methods of Hurwitz, Camp, Schur, and Birkhoff and Langer are 
those of asymptotic expansions which for the more general cases become very 
complicated. In the following pages it will be shown that a large class of 
so called self-adjoint boundary value problems, analogous to equations 
with symmetric kernels in linear integral equation theory, can be treated 
by much more elementary methods. 

A boundary value problem adjoint to the problem (1) has the form 
= > (Aas + ABai)Za; 


(3) dx 


[ PaiZa(a) + QuiZa(b) =0 (i 1,2, n), 


where the coefficients Px, Ox satisfy the relations 
MiaPar — = 0 (i,k = 1,2, een » 


The original problem is defined in this paper to be self-adjoint if the systems 
(1) and (3) are equivalent under a transformation of the form 


n 
Tia(*) Yay 
a=l 
* See pp. 83, 89, 109. One should note also the hypotheses on the roots of D(A) on pp. 98 
and 105. 
t See also an earlier abstract by Birkhoff, Bulletin of the American Mathematical So- 
ciety, vol. 25 (1919), p. 442. 
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and it is definitely self-adjoint when a further hypothesis, explained in 
Section 2, is added. For a definitely self-adjoint boundary value problem the 
characteristic constants and functions are not only denumerably infinite in 
number, but the constants are real and each has its index equal to its multi- 
plicity as a root of the characteristic dete:minant. The characteristic func- 
tions may be chosen real. Furthermore expansion theorems of a very general 
sort may be justified by relatively simple methods analogous to those of 
integral equation theory. 

The problems of Hurwitz and Camp described above are definitely self- 
adjoint according to the definition of this paper. Those of Schur are never 
self-adjoint, and a sub-class only of the problems considered by Birkhoff 
and Langer have this property. The problems of Birkhoff and Langer are 
furthermore never definitely self-adjoint when the coefficients in the differ- 
ential equations are real. On the other hand, the theory of definitely self- 
adjoint boundary value problems as presented here includes a large category 
of problems for which the determinant |B a(x)| vanishes and which do not 
fall under any of the cases treated by the authors just referred to. In this 
category are problems of the type (1) which arise in connection with the 
calculus of variations, all of which have the property of self-adjointness 
described above. The importance of this class is evident when one considers 
the fact that the boundary problems of mathematical physics almost in- 
variably belong to it. Another case is the system of the type (1) equivalent 
to the well known boundary value problem for a single linear differential 
equation of the mth order. This system does not fall under the theories pre- 
sented in the papers mentioned above, and only self-adjoint equations of 
this sort give rise to self-adjoint problems of the type considered in the follow- 
ing pages. The interrelationships just mentioned between problems of 
various types are not elucidated in detail in this paper. I expect to return to 
them on another occasion. 

The properties of the system (1) which justify expansion theorems 
analogous to those well known for other problems have not so far been clearly 
classified and analyzed. They seem to depend primarily upon the character 
of the matrix of functions By(x). A further study of this question, and a 
correlation of the methods and results hitherto attained, is desirable. 

The methods used in the following pages are developed independently 
of the theory of linear integral equations though there are many points of 
contact. In Section 4 below it is shown that the equations (1) are equivalent 
to a certain system of linear integral equations for the functions y;,(x), a 
result which is well known. The kernel matrix of these integral equations is 
not in general symmetric. When (1) is self-adjoint, however, every solution 
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yi(x) of the integral equations goes into a solution of the “associated” 
system of integral equations by means of a transformation of the form 


a=l 
for which the matrix ||S,|| is symmetric. If the determinant |B | is different 
from zero the same is true of |S ,,|,and the system of integral equations equiva- 
lent to (1) can be reduced to one with a symmetric kernel matrix by means 
of a suitable transformation. The theory of the boundary value problem 
is then a corollary to the theory of a system of integral equations with 
symmetric kernel matrix such as has been developed, for example, by E. H. 
Moore in his “general analysis.” The case when the determinant |Bx| 
vanishes includes some of the most important boundary value problems, as 
has been explained in the preceding paragraphs. For some of these it is 
possible to reduce the number of functions y,(x) by transformation and 
thereby to change the problem into an equivalent one whose corresponding 
integral equations have a symmetric kernel matrix, but it does not seem 
possible always to make such a transformation. 

1. Adjoint systems. In the following pages the notations of tensor 
analysis will be used. It is understood that the indices 7,7, k, ---, a, B, 
y, have the range 1, 2, - - - , and that an expression such as A 
stands for the sum 


AiaBra = + + A inB in 


taken with respect to the repeated index a. The index a may be called an 
umbral index.* Such indices will usually be denoted by Greek letters. 
As a matter of formal algebra, consider the four linear expressions 


si(y) = M + t;(z) = P aiZa(a) + Qaiza(b), 


4 
(4) = Mia¥a(a) + Niaya(d), = Paiza(a) + Qaiza(d), 

formed for two sets of functions y,;(x), z;(x). The coefficients in these ex- 
pressions are real constants and the symbols on the left are merely notations 
for the sums on the right. If the coefficients in the sums s,(y) are given, with 
matrix of rank n, then a matrix of coefficients for ¢;(z) can always be selected 
also of rank » and so that the relations 


(5) M iaP ar =0 


* Murnaghan, Vector Analysis and the Theory of Relativity, p. 7. 
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are satisfied. The coefficients of the auxiliary sums 5,(y) and /,(z) can then 
be chosen so that the two matrices 

Mi Nix — Pu 
Nix | Orn 
with 7 constant in the rows and & in the columns, are reciprocals. From the 


relationships between the coefficients of these matrices found by multiplying 
them in reverse order it follows readily that the equation 


(6) 


’ 


(7) + 5a(y)ta(2) = 


is an identity for all systems y,(x), 2;(x). This is an equation which will be 
frequently useful. When the coefficients of the sums s,(y) are given the 
coefficients of the #;(z), chosen as above indicated, form 2n linearly indepen- 
dent solutions u;, v; of the equations 


Miata = 0. 


All other solutions of these equations are expressible linearly in terms of 
these , and it follows readily that two different choices of the coefficients 
in ¢;(z) give systems of equations /;(z) =0 which are equivalent. 

The systems of differential equations and boundary conditions con- 
sidered in this paper may be written in the form 


(8) yi (x) = Asa(x)ya(x), Si(¥) = Miaya(a) + Niaya(d) = 0. 

The coefficients A x(x) are supposed to be real single-valued continuous 
functions on the interval a<x<b; the matrix ||Mix, Nx|| of coefficients in 
s;(y) is of rank n; and solutions y,(x) of the equations, continuous with their 


derivatives on the interval ab, are sought. A system adjoint to (8) is by 
definition one of the form 


(9) 2i(x) = — Aas(x)za(x), = Pasta(a) + Qaita(b) = 0 


where ¢;(z) has been formed from s,(y) in the manner described in the pre- 
ceding paragraph.* 

Let ||Yu(x)|| be a matrix of functions whose columns form 7 linearly 
independent solutions of the differential equations in the system (8). The 
most general solution of these equations has the form y,(x) = Y jaca where the 
coefficients c; are constants. The solution Y,;,---, Yn; will be denoted 
simply by Y; when no confusion results, and the determinant whose elements 


* This definition is that of Bounitzky, p. 73, and is analogous to that of Birkhoff for linear 
equations of the mth order, these Transactions, vol. 9 (1908), p. 375. 
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are Dy. =s;(Y;) will be denoted by D. For the adjoint system (9) the corre- 
sponding notations will be Z;,, E. The first two theorems to be proved below 
are analogous to similar theorems for linear integral equations and for other 
types of boundary value problems, and are already in the literature.* 


THEOREM 1. If the determinant D is different from zero the only solution of 
the boundary value problem (8) is y;(x)=0. If D has rank n—r then the problem 
has r and only r linearly independent non-vanishing solutions. 


This is easy to see, since every solution of the boundary value problem 
must have the form y;=V sca with coefficients c; satisfying the linear 
equations D jaCa =5i(Va)Ca=0. 


THEOREM 2. The number of linearly independent non-vanishing solutions 
of the original system (8) 1s always the same as the number for the adjoint system 
(9). 

To prove this one may note first that for every pair y,(x), 2;(x) of solu- 
tions of the differential equations in (8) and (9) the equations 


+2398 = ApaVatp — Aap¥pta = 0, 
(10) Ya(x)Za(x) 0 


hold true. Suppose then that Ui,,---, Unp (p=1,2,---,7) arer linearly 
independent solutions of the boundary value problem (8). The r sets $;(U;) 
(p=1, 2,---, 7) are also linearly independent, since otherwise there would 
be a set of functions y;=U,,c,, with constants c, not all zero, making the 
expressions 5;(y), as well as the sums s;(y), all zero. Since the determinant 
of the first matrix (6) is not zero the equations (4) would then imply y,(a¢) = 
ys(b) =0, and the solution y,(x) would have to vanish identically, which is 
impossible when the solutions U, are linearly independent. From the rela- 
tions (7) and (10) it follows now that 


Sa(U5)ta(Zx) = 0 (k=1,2,---,”; 
Hence the determinant E whose elements are Ey =?;(Z,) has at most rank 
n—r. By similar reasoning it follows that when E has rank n—r the rank 


of D is at most m—r,so that D and E have the same rank, and the number 
of linearly independent solutions of the systems (8) and (9) is the same. 


THEOREM 3. If the determinant D is different from zero the non-homogeneous 
system 


(11) (x) = Aia(x)ya(x) + g(x),  sily) = hi, 
* Bounitzky, p. 77; Birkhoff-Langer, p. 64-5; Hurwitz, p. 526; Camp, p. 30. 
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where the functions g;(x) are continuous on the interval ab, has one and only 
one solution. 


The differential equations of the system (11) have as their general solution 
(12) yilx) = Vio(x) + Via(x)ca 

where Yo is a particular solution and the sets Y, are as before linearly inde- 
pendent solutions of the equations with g,(x)=0. Since D=|s,(Y;)| is 
different from zero the constants c; can be chosen in one and but one way so 
that 


= s(Vo) + si(Va)ea = hy. 


THEOREM 4. If the determinant D has rank n—r then the system (11) has 
solutions if and only if the equation 


(13) f 


is satisfied for every solution z;(x) of the adjoint system (9). The most general 
solution of (11) is then 


where yi*(x) is a particular solution and the sets Uip (p=1,-++, 97) are r 
linearly independent solutions of the original system (8). 


If y;(x) and 2,(x) are solutions of the differential equations in the systems 
(11) and (9), then one readily verifies that 


(14) Yala t Vata = Lara, 
and hence, with the help of equations (7), that 


b 
(15) ff = = + 


If y and z satisfy the boundary conditions in (11) and (9) the value of this 
expression is zero. 

Suppose, conversely, that equation (13) holds true for every solution 
2;(x) of the adjoint system (9). Every solution of the original system (11) 
must have the form (12) with constants c; satisfying the equations 


si(y) = si(Vo) + cas(Va) = 0. 


By the argument of the proof of Theorem 2 it follows that the sets /,(z), 
formed for r linearly independent solutions z,;(x) of the adjoint system (9), 
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are themselves r linearly independent solutions of the linear equations whose 
coefficients are the columns of the determinant D=|s,;(Y,)|. Hence the 
last equations have solutions c; if and only if the conditions /,(z) sa(Yo) =0 
are satisfied for every solution z;(x) of the adjoint system (9). But from 
equation (15) with y replaced by Yo, and from the boundary conditions 
t,(z) =0, it follows then that solutions c; will certainly exist since the con- 
ditions 


0= f = Sa(Vo)ta(z) 


are satisfied. 

The last statement of the theorem is true since if y,(x) and y,*(x) are 
both solutions of the non-homogeneous system (11), their difference y,(x) 
—y.*(x) satisfies the original homogeneous system (8). 

2. Self-adjoint systems. Consider now a system of the form 


(16) yt (Aia + ABia) Va; si(y) = M iaya(@) + Niava(b) =0 


containing a parameter J linearly and having its coefficients A x(x), Bix(x) 
all continuous on the interval a<x<b. The functions B y.(x) are by hypothe- 
sis not all identically zero. An adjoint system has the form 


(17) (Aas + ABai)Za; t;(z) P aiZa(a) + QaiZa(b) = 0, 


where s;(y) and #,(z) are related as in the preceding section. 

The existence theorems for differential equations tell us that there exists 
a matrix ||Y (x, )|| whose columns are m linearly independent solutions 
of the differential equations in the system (16) and whose elements with their 
derivatives with respect to x are expressible as permanently convergent 
power series in X. The determinant D(A) formed for this system is therefore 
also representable by a permanently convergent power series. The roots of 
D(a) are called the characteristic values of the parameter \, and the non- 
vanishing solutions y,(x) of the system (16) corresponding to such values 
are called characteristic solutions. The corresponding notations for the ad- 
joint system (17) are Z,(x, \) and E(A). It is well known that the roots of a 
permanently convergent power series D(A) are finite or at most denumerably 
infinite in number. The following theorem is an immediate consequence of 
Theorems 1 and 2 of the preceding section. 


THEOREM 5. The characteristic parameter values for the boundary value 
problem (16) are identical with those for the adjoint problem (17). The number 
of linearly independent characteristic solutions corresponding to a particular 
characteristic value d is the same for both problems. 
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THEOREM 6. Jf are characteristic values and y;(x), corre- 
sponding characteristic solutions of (16) and (17), then 


b 
(18) f Bap(x)ya(x)ze(x)dx = 0. 


For from the differential equations in (16) and (17) with A, and dg, sub- 
stituted it follows that 


Yo Za + Vata = sta 


and hence, from the boundary conditions s;(y) =¢;(z)=0 and equation (7), 
that 


b 
b 


In the following pages a set of relations of the form 2;=T a(x) ¥q will be 
called a transformation if the functions T(x) are real, single-valued, and 
have continuous derivatives on the interval ab, and if the determinant 
\T (x)| is different from zero on that interval. The coefficients of the in- 
verse transformation will be denoted by T%}(x). 

Definition of a self-adjoint system. The boundary value problem (16) 
is said to be self-adjoint if the differential equations and also the boundary 
conditions of its adjoint (17) are equivalent to its own for all values of A 
by means of a transformation 2;=T a(x) Va. 


THEOREM 7. In order that the problem (16) shall be self-adjoint it is neces- 
sary and sufficient that there shall exist a transformation T x(x) such that 


(19) TiaA ak + Aail ak + Ta= 0, TiaBak + BaTar= 0, 
(20) MiaT = NiaT (b)N 


To prove the first two of these relations one can verify readily that the 
transformation 2;=T ;«y, takes the differential equations of the system (16) 
into the set 


-1 -1 | 
zi = [TiaA apT py + AT iaBasT py + TisT oy 


If these are equivalent to the differential equations in the adjoint system 
(17) for all values of A, the equations (19) follow at once. 

* By the transformation z;=T jay. the boundary conditions of the adjoint 
problem take the form 


PaiT + QaiT ap(b)ye(b) = 0. 


4 

0= |, = Bgavatpdx. 
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For these to be equivalent to the original boundary conditions s;(y)=0 
it is necessary and sufficient that a matrix of constants Cy, with determinant 
different from zero exists such that 


(21) PaiTar(@) =CiaMark,  QaiTar(d) = CiaNar. 
With the help of the relations (5) it follows that the preceding equations 
imply 
-1 -1 
iy iy] = 0. 
Since |C x | 0 the equations (20) of the theorem follow at once. Conversely, 
since the matrix ||M., N «|| is of rank , the equations (5) and (20) imply 


relations of the form (21), so that the boundary conditions of the two systems 
are surely equivalent. 


THEOREM 8. For a self-adjoint system (16) two characteristic solutions 
ya(x), corresponding to distinct characteristic valus satisfy the equation 


b 
(22) f Sas) = 0 


where Sux =Tai 


This is an immediate consequence of the equation (18) and the trans- 
formation 2;=T iaVe- 

For the following definition it is important to note that when the matrix 
\|Sia|| is symmetric the bilinear form Sasf.fa, formed for a set of numbers f; 
and their conjugate imaginaries / ;, is always real, since such a form is identi- 
cal with its conjugate. The functions g,(x) in the definition are supposed to 
be continuous on the interval ab. 

Definition of a definitely self-adjoint boundary value problem. A problem 
(16) is said to be definitely self-adjoint if the matrix ||Sx(x)|| is symmetric 
and the bilinear form Sas(x) fafs is positive or zero at every point of the inter- 
val ab, and if furthermore this form vanishes identically for a set of solutions 
fi(x) of a system of equations of the type 


(23) fi (x) Aja(x)fa(x) + Bia(x)ga(x) 


only when the functions f;(x) are all identically zero. 

The conditions of the definition will surely be satisfied if the quadratic 
form S.s(x) fafs is positive definite at every point of the interval ab, \since the 
bilinear form Sas fafs is then always positive for non-vanishing arguments 
fi, as one readily verifies. In that case the determinant |Su|=|Ta| |Bal is 


¥ 


1926} A BOUNDARY VALUE PROBLEM 571 


everywhere different from zero. The same is therefore true of |B,|, and the 
second equation (19) shows that 


= Six — See = TaiBar — Tar Bai = (Tas + Tia) Bar. 


It follows readily that the matrix ||7'|| is skew-symmetric. Since a skew- 
symmetric determinant of odd order always vanishes, and since the deter- 
minant |7 | must be different from zero, it is clear that this case can arise 
only when m is even. It should be noted, however, that the definition is 
applicable to cases when the determinant |B,,| vanishes and the quadratic 
form Sagfafs is not definite. Important special cases of this sort are the 
boundary value problems arising from the calculus of variations and the 
problem which arises when a boundary value problem for a self-adjoint 
linear differential equation of the mth order, of a type hitherto often studied,* 
is transformed into one of the type (16). 

If the bilinear form S.f.fz is non-positive it can always be replaced by 
one which is non-negative by using the transformation with coefficients 
—T« instead of The requirement of symmetry for the matrix ||S «|| 
is also not as stringent as it perhaps appears to be at first sight. If the equa- 
tions (19) have a system of solutions T,, then the systems T x=Ti; and 
T x—T are also solutions. The matrix of elements 
is skew-symmetric, and one readily verifies by means of the relations (19) 
that for a skew-symmetric system Ty, the elements Sy have the symmetry 


required. If the matrix of elements T;;—T 4 is to be useful for a transforma- 
tion, however, the determinant |7.;—7%| must be different from zero. 

When the problem (16) is definitely self-adjoint the elements Sx are 
expressible in the various forms 


(24) Sit = TasBar = TarBai = — TiaBar 


as one readily verifies from the symmetry of S, and the relations (19). 

3. Properties of self-adjoint systems. Boundary value problems of the 
type (16) which are definitely self-adjoint have many properties analogous to 
those of linear integral equations with symmetric kernel functions, as indi- 
cated in the following theorems. 


THEOREM 9. For a definitely self-adjoint boundary value problem (16) 
all roots of the characteristic determinant D(d) are real and the linearly inde- 
pendent characteristic solutions corresponding to each root may be chosen real. 


* See, for example, Darboux, Théorie des Surfaces, vol. 2, p. 109; Birkhoff, loc. cit., p. 373; 
Bounitzky, p. 88. 
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For suppose \ a root of D(A) and y,(x) a non-vanishing solution of the 
system (16) corresponding to it, and let \ and 9,(x) be their conjugate imagin- 
aries. If X were not real, equation (22) of Theorem 8 would require the bi- 
linear form S.gya¥3 to vanish identically in x, which is impossible when the 
solution y,(x) is not identically zero. Hence the root \ is real. But if d is 
real then the real and imaginary parts of y;(x) are separately solutions of the 
system (16), and it is evident that a linearly independent set of real char- 
acteristic solutions corresponding to \ can be selected. 


THEOREM 10. The index of each characteristic number Xo, 1.e., the number 
of linearly irdependent characteristic solutions y;(x) corresponding to it, is 
equal to the multiplicity of > as a root of D(A). 


Suppose that D(A) = ls{ has rank m—r at a particular value Xo. 
By replacing the solutions Y;(x, \) by suitably selected linear combinations 
of them with constant coefficients it may be brought about that for \=Xo 
the expressions s;(Y,) (p=1, - - - , 7) all vanish, while the matrix of elements 
(g=rt+1,---,m) hasrank n—r. All derivatives of D(A) of order less 
than r will then clearly vanish at \=o, and the rth will have the value 


where the subscript A indicates derivatives. If this expression vanished there 
would be a linear combination 


ye = + Vien) + V in) 


for which all the numbers s;(y) would vanish at A=Xo. The constants 
¢1, °° +, ¢, could not all be zero because the rank of the last »—r columns of 
D® (Xo) is n—r. The functions y;; whose derivatives for \ are in the first 
parenthesis would therefore not vanish identically. For \=\) they would 
satisfy the system (16) and also the equations 


= (Aiea + ABia) Yarn + Biaya- 


The set yi2 defined by the second parenthesis would satisfy the differential 
equations of the system (16), and it follows readily that the functions y,; 
themselves would for A=» be solutions of the non-homogeneous system 


yi = (Aja + AoBia) Ya + Biayai, si(y) = 0. 


The functions 2i::=Tia¥a1 would satisfy the adjoint equations (17), and 
from equations (13) of Theorem 4 it would follow that 


ff -f YaSasypidx = 0. 
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This could not be true, however, since the functions y,; would not all vanish, 
as was seen above. It follows therefore that the derivative (25) is different 
from zero and that A» has its multiplicity equal to its index. 


THEOREM 11. For a system of functions f (x) continuous on the interval ab 
and satisfying the condition 


(26) f = 0 


with every characteristic solution y;(x) of the boundary value problem (16), 
the functions B ia(x)fa(x) all vanish identically.* 


To prove this let f;(x) be a set with the properties described in the 
theorem. According to Theorems 3 and 4 the non-homogeneous system 


(27) yi (Aia + ABia) Va + Biafa; si(y) = 0 


then has solutions for every value of \, since the equations (26) imply the 
conditions analogous for this case to equations (13) of Theorem 4. When 
D(A) #0 there is a unique solution and one verifies readily that it consists of 
the functions 


Vio Vin 

Sa( Yo) Sn( V1) Yn) 


where VY io(x, A) is a particular solution of the differential equations in 
the system (27). 

Near a root Xo of multiplicity r of D(A) the functions y;(x, A) are still well- 
defined and analytic in X. For, in the first place, one can add constant 
multiples of the last x columns of the determinant to the first column in such 
a way that the resulting functions Yj satisfy the conditions s;(Yo)=0 at 
A=Xo. In the second place the r linearly independent solutions 2;(x) of the 
adjoint system (17) for A=» provide r linearly independent sets #.(z), as in 
the proof of Theorem 2, such that 


Sa(Vi)ia(z) = --- = = 0. 


One can therefore replace r of the rows of the determinant in the expres- 
sions (28) by r linear combinations of its m rows which vanish at A\=Xo. 


* The idea underlying the proof of this theorem is very well known. It has been used, for ex- 
ample, by E. Schmidt, p. 18; Hurwitz, p. 539; and Camp, p. 37. 
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It is clear that the determinant in (28) has the same factor (A—Xo)’ as 
D(d), and hence that the functions y,(x, \) are analytic near X» as well as 
near the values of A at which D(A) ~0. 

The functions y;(x, \) are representable by permanently convergent 
power series in A of the form 
(29) A) = + + + 
By substituting these series in the equations (27) and comparing coefficients 
of \ it is found that the coefficients uj, (u=0, 1, 2, - - -) satisfy the equations 
(30) = Aigtay + = 0 
in which it is agreed that u;,_1=f;. With the help of equations (30) and (19) 
one verifies also that the functions vi, =7 jaMa» are solutions of the system 

Die A citar t;(v,) = 0, 

where -1=T iafa, and it follows that 


Uaular + Uaplar = 6 aplar gag 


This result and the equations (7) and (24) now justify the relation 


b b 
0= = f apt f apts »—1dx. 
a a 


a 


The analogue of a well known inequality of Schwarz 


2 
a a : a 


which will be proved in the next section, together with the last equation, 
show that the constants 


b 
W,= f UadS 
have the properties 


b 
(31) Wem = W,- 


The series 
(32) WotWrt-:::, Wot Wadr?+---, 
the first of which is found by integrating the expression aoS ag found from 


(29), converge for every value of X. Some coefficient in the second series 
must vanish. Otherwise it would follow from the inequality (31) that 
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W>2,=>W.(W:/Wo)*, and the second series (32) would not converge for \= 
(W2/W,)*. But if a particular coefficient W», vanishes the inequalities (31) 
imply that all the preceding ones of even index are also zero. From the equa- 
tion W,=0 it follows, however, that the functions #, all vanish identically, 
and the first of the equations (30) shows that the functions u;_1=Biafe 
do the same. 


Corottary 1. If the determinant |Bix(x)| is different from zero on the 
interval ab, as in the case when the matrix ||S x|| =||Tai Bax|| is positive definite 
at each point of ab, then the set of functions f (x) =0 is the only one satisfying the 
relation (26) with all characteristic solutions y;(x) of the boundary value problem 
(16). 

Corotiary 2. The only set of solutions f (x) of a system of equations of the 
form 


(33) fi Aiafe + 
satisfying the relations (26) with all characteristic solutions y,(x) is the set 
f(x) =0. 

With the help of the last corollary it is possible to prove 


THEOREM 12. The totality of characteristic solutions and characteristic 
constants of the boundary value problem (16) are denumerably infinite in number 
and may be represented by the symbols yi (x), »(v=1,2, - - -). These functions 
may furthermore be chosen normed and orthogonal in the sense that 


b 
(34) f apy pdx = by (Sun = 1, dy = Oifu ¥ »). 


Let the functions g, in equations (33) have the form g;=d, gi, (o=1, - - -, 
p). The functions g,, can be chosen so that the p systems Bia Zap (9p=1,---, 
p) are linearly independent, since the functions B, do not all vanish identi- 
cally. The system (33) has a solution of the form f;=d,f;, where the system 
fi, for p fixed is a particular solution of the equations (33) with g,; replaced 
by gi. The solutions f;, are readily seen to be linearly independent since the 
sets Bia Sap(p=1, ---, p) have this property. If there were a finite number 
only of characteristic solutions of the boundary value problem (16) then for 
a sufficiently large value of p, constants d, not all zero could always be 
selected so that the functions f;=d, f;, would satisfy the relations (26) with 
every such solution, and this would contradict Corollary 2 above. Hence 
there must be an infinity of characteristic solutions, and the infinity is 
denumerable since the roots of the permanently convergent series D(A) are 
denumerable. 
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Since no one of the sets y;,(x) for fixed yw is identically zero the integrals 
(26) for y=y are all different from zero. The characteristic functions y,,(x) 
may therefore be normed and orthogonalized as described in the theorem by 
a well known process.* 

4. Expansion theorems. With the help of the theorems of the last 
section it is possible to deduce some very general expansion theorems for 
sets of functions f;(x). The characteristic solutions y;,(x) of the definitely 
self-adjoint boundary value problem (16), appearing in these theorems, are 
supposed to be normed and orthogonal. 


THEOREM 13. For every solution f;(x) of a system of equations of the form 
(35) fi = Aiafa + Biaga, si(f) = 0, 
in which the functions g(x) are arbitrarily selected continuous functions on the 
interval ab, the series 
a 
converge uniformly and the functions Bia(fa—Ga) are all identically zero on ab. 


The uniform convergence of the series will be proved in a later section. 
The rest of the theorem follows at once from Theorem 11 since the equations 


f YawSas(fe — = 0 (u = 1,2,---) 


are immediate consequences of the definition of the functions ¢,. 


Corotrary 1. If the determinant |Bx(x)| is different from zero on the 
interval ab, as in the case when the matrix ||S x|| =||Tai Bax|| is positive definite 
at each point of ab, then for every set of functions f ;(x) having continuous deriva- 
tives on this interval and satisfying the boundary conditions s;(f)=0 the series 
(36) converge uniformly and represent the functions f ;(x). 


In this case the equations (35) determine uniquely a set of functions 
g(x) corresponding to the given functions f;(x), since the determinant |B | 
is different from zero. 


Corotiary 2. If the functions f;(x) are solutions of a system (35) and 
the functions g;(x) solutions of a similar system 


(37) =A t Baha, Ss(g) = 0, 


* See, for example, E. Schmidt, p. 4. 
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in which the functions h;(x) are continuous on ab, then the series (36) converge 
uniformly and represent the functions f (x) on this interval. 


To prove this the equations (35), (17), and (7) can first be used to show 
that for every characteristic solution y,,(x) of the system (15) the corre- 
sponding functions zi, =T jay a» satisfy the relations 


b b 
= f Yaw — Yar apfadx. 


b 


(38) 0 = Zay, a 


Since the functions g;(x) satisfy equations (37) it will be seen that the series 


b b 
Dor x) f YarSasfedt = yir(x) f YarS 


converges uniformly by the same proof as that which shows the convergence 
of the series (36). It follows readily with the help of the differential equations 
(16) for the functions y,,(x) that the series of derivatives of the terms of (36) 
converges uniformly and represents ¢{ (x). Further 


b 
(39) gi’ (x) = A iaPa + Bia f YarSapgedé . 


Since the functions Bia(fa—ga) vanish identically, (fa—¢a) Sas(fs—¢s) 
vanishes identically. Since from equations (35) and (39) the differences 
fa—¢q satisfy an equation of the type (23) it follows that they are identically 
zero, which was to be proved. 

5. Green’s matrix. Consider again the system 


(40) yi AiaYa; si(y) M iaVa(a) + NiaVa(b) =0 


described in § 1, and suppose that its determinant D is different from zero. 

Definition of the Green’s matrix.* A matrix of functions Gy(x, f), 
single-valued for a<x<b, a<£Sb, and further such that they are continuous 
and have continuous first derivatives in x except at x=&, is called the 
Green’s matrix of the system (16) if it has the properties 


(41) Gyx(é + = (555 = = 0 if t k), 
(42) Gulz,t) A ja(x)Gar(x,€), 

Ox 
(43) MiGar(a, + = 0. 


* The existence and uniqueness of the Green’s matrix has been proved by several writers. 
See Bounitzky, p. 77; Birkhoff-Langer, pp.66-70. 
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The properties (42) and (43) show that the columns of the matrix would 
be solutions of the system (16) of the kind demanded by the boundary value 
problem if it were not for their discontinuities at x=§ Let Du and Ax 
be defined by the equations 
(44) Dis = MiaVar(a) + NiaVar(d), Ase = MiaVar(a) — NiaYar(d). 


The determinant D=|D.| is different from zero by hypothesis. The func- 
tions 


are then well-defined and evidently have the properties (41) and (42). 
Equation (43) follows readily with the help of the notations (44). 


THEOREM 14. The functions H (x, £) = —Gyi(&, x) defined by the equations 
(45) are the elements of the Green’s matrix of the adjoint system (17), so that 


(46) Giix,x — 0) — Giilx,x + 0) = bes, 


(47) Gis(x,£) = — 


(48) PaiGra(x,a) + QaGka(x,b) = 0. 


By differentiating the equations Vig Ya =. it can be shown that the 
rows of the matrix Y%' are solutions of the differential equations of the 
adjoint system (17), and the properties (46) and (47) are then evident from 
the form of the equations (45). The relations M jaPar—N iaQazr =0 between 
the elements of the reciprocal matrices (6) justify the equations 


+ = Dial — + (0) | 


and equation (48) can be proved with the help of them. 


THEOREM 15. If the functions g;(x) are continuous on the interval ab then 
every solution y;(x) of the non-homogeneous system 


(49) yi = AiaVa + gi, si(y) =0 
is expressible in the form 


(50) = f 


and conversely every system y,(x) defined by equations (50) is a solution of (49). 


é 
2 
> 
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From the equations (49) for x =£, and equations (47), it follows that 


ae [Gra(x,£)ya(E)] = 
Hence 


b 


With the help of the relations (46), (48), si(y) =0, and (7), the sum of these 
two equations gives equation (50). 

The converse is easily proved by writing the expression (50) for y,(x) 
as the sum of integrals on the intervals ax and xb, differentiating with respect 
to x, and using the relations (46) and (42). 


Corotiary 1. The Green’s matrix is unique. 


For if there were two, say Gx and Ly, the equation 
— Leal = 0 
would be an identity in x for every set of functions g.(), and this can be 
true only if Gu = 


CorOLLARY 2. When \=0 is not a characteristic number then every solution 
ys(x) of the boundary value problem 


(51) yi = (Aia + ABia) Vi; si(y) = 0 


is a solution of the system of linear integral equations 
(52) =r 8) Basle 


and conversely. 


By replacing \ by \i1+A in the equations (16) it can readily be brought 
about that the value \ =0 is not a characteristic number, if this is not already 
the case. 


THEOREM 16. If the boundary value problem (16) is self-adjoint then the 
system (52) is equivalent to the corresponding system 


(53) a(x) f 


| 

i 

{ 

| 
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for the adjoint problem by means of the transformation 2;=T jaya. The functions 
Gu in this case satisfy the relations 


(54) GailE, x) Tax(é) 


Since the original boundary value problem and its adjoint are equivalent 
under the transformation z;=T ..V. it is evident that the systems (52) and 
(53) must have the same property. 

To prove equation (54) one may first deduce the second of the systems 


yi AiaVa + Ba; si(y) 0, 
A aita + t,(z) =0 


from the first by means of the transformation 2;=T j2ya. From Theorem 15 
it follows then that the systems 


yf) = f 


s(2) = f 


are equivalent by this transformation. But from these one verifies that the 
equation 


f [Hia(x,£)Tas(t) — = 0 


must hold for all sets of functions g.. Since H «, (x, £) = —G;;(£, x) this proves 
equations (54). 
If the elements of the kernel matrix of the system (52) are denoted by 
Ki,(x,&) = 


then this system and the one “associated” with it, according to the theory 
of integral equations, are 


b 
f 

(55) 
(2) = d f 


When the boundary value problem (51) is self-adjoint the relations (54), 
with (24), show that 


Kailé, %)Sax(€) Sia(x) K gx(x,€) ° 


/ 
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It is provable readily then that every solution y,(x) of the first of equations 
(55) goes into a solution u,(x) of the second by means of the transformation 
ui(x) =Sis(x) ye(x). If the determinant |S;,| is different from zero the equa- 
tions (55) are completely equivalent by means of this transformation. In 
that case a symmetric matrix U(x) can be determined such that 


U ia(x) (x) = S 


and the transformation 1;=U ay. takes the first system (55) into the 
system 


whose kernel matrix is readily seen to be symmetric. Such a reduction is 
not possible when the determinant |S | vanishes. 

It is evident that every solution of the system (53) defines a solution of 
(55) by means of the transformation u;=B,.z.. Conversely, if a solution 
u;(x) of the equations (55) is known, for a particular value of A, then the 
functions 


b 
a(x) =X f Gai(é, x)ua(E)dé 


satisfy the relations u;=B,,2. and the equations (53). If the boundary 
value problem (16) equivalent to the system (52) is definitely self-adjoint, 
every solution of the system (52) defines a solution of the conjugate system 
(55) by means of the symmetric transformation u;=S isyg=Bai Tas ye, as 
one may infer from the transformation 2; = T jg ys relating the solutions of the 
system (52) and its adjoint (53), or directly by means of the relations (54) 
and (24). 

It is clear then that the theory of a definitely self-adjoint boundary 
value problem (16) may be regarded as a special case of the theory of a 
system of linear integral equations (52) whose solutions go over into solutions 
of the conjugate system (55) by means of a symmetric transformation 
u;=Sig ys. From the preceding paragraphs it is evident that such systems 
have many properties analogous to those of systems whose kernel matrix 
is symmetric. The symmetric case is the one which arises when the matrix 
of functions Sy, is the identity matrix. It would be interesting to investigate 
in detail the theory of such systems of linear integral equations. 

6. The convergence proof. For the purpose of proving the uniform con- 
vergence of the series (36) of Section 3 a number of lemmas are required 


* See, for example, Bécher, Higher Algebra, p. 299. 
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which are analogous to those used for similar purposes in the theory of a 
single linear integral equation,* and which have for the most part been 
frequently applied in more generalized form than here given by E. H. 
Moore in his “general analysis.” 

It is understood that the functions S (x) are continuous and the quadra- 
tic form Sas fa fs20 at every x on the interval ab and for every set of argu- 
ments f;. The functions f;(x), ga(x), yis(x) in the following theorems are at 
least bounded and integrable on the interval ab, and the sets yy(x) (u= 
1, 2, - - -) are normed and orthogonal in the sense described in Theorem 10. 
The constants y, are the numbers 


The first lemma below follows at once from the readily proved equation 


(56) f [fa — — = f faSasfsdx — Divas 


where the sums without range indicated are taken with respect to pw, v over 
the same arbitrarily selected set of a finite number of positive integers. The 
second and third lemmas are immediate consequences of the first one. The 
integral in the second member of equation (56) is called the norm of the 


set f;(x). 
6 
LemMaA 1. < ff. Sasfadx. 
Lemma 2. yi converges. 


Lemma 3. [If the norm of the set g;(x) 1s zero then 


b 


Suppose now that the functions /;(x, £) are bounded for all values of 
x and & on the interval ab, and integrable in x on that interval for every 
fixed & Let 6, and e,(£) denote the integrals 


b 


* See E. Schmidt, pp. 1-4. 


ra 
& 


1926] A BOUNDARY VALUE PROBLEM 583 
Lemma 4. The series ))>-; 5,¢.(€) converges uniformly on the interval 
asésb. 


The proof is analogous to one given by Schmidt.* In the sum 
pta 
u=p 


let the index p range over those numbers for which 6,¢,(£) is positive for a 
fixed , and o over those for which these terms are negative. Then by 
Lemma 3 


( > 


where A is the maximum of the norm of the functions /,(x, £) on the interval 
a<é<b. A similar inequality holds for the negative terms, and it follows 
that 


2 1/2 


ptn 


Since the series of Lemma 2 converges it is evident that the similar series 
of terms 6; has the same property, and from the last inequality it follows that 
the series of Lemma 4 converges uniformly on the interval ab. 


THEOREM 17. If the functions f;(x) are solutions of a system of the form 
fi Aiafa + Biaga; si(f) = 0 
then the series ia(X) converges uniformly on the interval ab. 


From equations (38) and (52) it follows that the series of the theorem are 
also expressible in the form 


p=l Ay a 


b 


* Pp. 2-4. 


7 


| 
| } 
| 
| 
| 


584 G. A. BLISS 


For each fixed value of 7 this is a series of the form whose uniform convergence 
is stated in Lemma 4. 
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ON THE THEORY OF INTEGRAL EQUATIONS WITH 
DISCONTINUOUS KERNELS* 


BY 
RUDOLPH E. LANGER 


CHAPTER 1. INTRODUCTION 


This paper is concerned with the theory of the integral equation whose 
salient characteristic is that its kernel K(x, £) is discontinuous along the 
line =x. It is closely related with the theory of the integral equation char- 
acterized by discontinuities in the partial derivatives of the kernel for 
=x, of which more specific mention is made below. 

The point of departure for the consideration involved is in each case 
furnished by the familiar fact of the equivalence of a differential system 
composed of an equation 


d™u 
dx” 


po—— +--+ + pat = du, 


and homogeneous boundary conditions, with the integral equation 
= rf 


in which the kernel is the Green’s function of the reduced differential system. 
The existing theory of differential boundary problems guarantees the exist- 
ence of characteristic values of \ and corresponding functions u(x) for the 
differential system in a broad class of cases. The same is true, therefore, for 
the equivalent integral equation, and viewed from the standpoint of the 
theory of integral equations these facts must be attributable to the peculi- 
arities of the Green’s function which serves as a kernel. 

Fixing the attention for the moment on the case in which the differential 
system is of the second order and self-adjoint, the properties of the kernel 
which are of particular interest in this connection are (1), its symmetry in 
its arguments, and (2), the finite non-vanishing discontinuities in its first 
partial derivatives for =x. The extent to which symmetry of the kernel 
serves as a basis for a theory of integral equations is shown by the theory 
of Hilbert and Schmidt. The question of the extent to which the second 

* Presented to the Society, January 1, 1926; received by the editors January 21 and March 
26, 1926. 
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property mentioned serves as such a basis has remained open. It appears 
to have been touched upon first by Birkhoff in 1906 in an unpublished dis- 
cussion of the equation with kernel both symmetric and discontinuous.* 

The development of a theory based only on the discontinuity of the first 
partial derivatives of the kernel was taken up by Mrs. Eleanor P. Brownf 
in a thesis prepared under the direction of Birkhoff and presented at Rad- 
cliffe College in 1921. This theory will be presented shortly in a joint paper 
by Mrs. Brown and the author. 

If the differential system under consideration is of the first order the 
discontinuity along the line = occurs in the Green’s function itself. It is 
this property which is taken as the distinguishing feature of the kernel of the 
integral equation to which the present paper is devoted. While the analytic 
details which are of interest differ in many respects, the methods used here 
are in the main parallel to those of the paper referred to above which is to 
embody Mrs. Brown’s thesis. They are similar also in many respects to the 
methods employed by Birkhoff and the author in a treatment of differential 
boundary problems.{ 

The content of the paper may be roughly summarized as follows. In 
Chapter 2 the hypotheses on the kernel of the given equation are enunciated 
and a change of variables is deduced which serves to normalize the discon- 
tinuities in the kernel and its first partial derivatives. In Chapter 3 the 
integral equation is transformed into a system composed of an integro- 
differential equation and a boundary condition, and in Chapter 4 the equiva- 
lence of this system with the given equation is established. Chapters 5, 6, 
and 7 are concerned with the integro-differential equation alone (without 
the boundary condition), and it is shown that the equation is possessed of a 
solution for all values of the parameter which are sufficiently large. The 
asymptotic form of this solution in any right-hand or left-hand half-plane is 
there deduced. 

In Chapter 8 the boundary condition is introduced and by means of it 
the existence, under certain restrictions, of infinitely many characteristic 
values for the parameter is proved and the asymptotic form of these values 
is obtained. In Chapters 9, 10, and 11, the normalized asymptotic forms of 
the characteristic functions for the given integral equation and its associated 


* Bulletin of the American Mathematical Society, vol. 13 (1906), p. 62. 

¢ Formerly Miss Eleanor Pairman. 

~G. D. Birkhoff and R. E. Langer, Boundary problems and developments associated with a 
system of ordinary linear differential equations of the first order, Proceedings of the American 
Academy of Arts and Sciences, vol. 58, pp. 51-128; April, 1923 
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equation are developed, and it is proved that when the given equation has 
only simple characteristic values the set of characteristic functions is closed. 
Lastly in Chapter 12 the expansion of an arbitrary function in a series of the 
solutions is considered. It is shown that for the expansion of any function 
f(x) which is integrable in the sense of Lebesgue the solutions of the integral 
equation have essentially the same properties as the solutions of a related 
system composed of a differential equation of the first order and a homo- 
geneous boundary condition. At the end points of the interval, and only 
there, do the two expansions behave in dissimilar fashion. The related 
differential system in question here is a special case of a system previously 
studied by the author.* 

In the appendix a simple example of an integral equation to which the 
developed theory applies is given, and by computed results various features 
of the theory are illustrated. 


CHAPTER 2. THE NORMALIZATION OF THE EQUATION 


1. The given equation. We consider given the equation 


8 
(1) y(t) = p f T(t,7)9(x)dr, 


in which p is a complex parameter. The kernel is real and satisfies the 
following conditions: 

(i) that it is possessed of partial derivatives to those of order n21 in- 
clusive, these partial derivatives being continuous in the open regions 


RY 
a 


and approaching in each of these regions a finite limiting value at every point 
of the boundary 7 =#; 
(ii) that 


T= t+ 


| = g(t) 0. 


T=t— 


2. The derivatives of y(t). The differentiability of the function ¢(¢) 
defined in (ii) may be derived from the facts which we formulate as follows. 


* Developments associated with a boundary problem, these Transactions, vol. 25 (1923), pp. 
155-172. 
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Lemma. If F(t, 7) is any function possessed of the properties (i) above, then 
dF (t,t + ) 


(a) =Ft,t+)+F,(t,t+), 
dF (t,t — 
(b) — ) + — 


Let t2>¢; be any two points of the interval (a, 8). Then 


F( ts, te +) +) —F(ts,t:) + F(ti,t2) — F(ti,t: +)}, 


where At =/,—?,. By the law of the mean the right hand member of this equals 
— 0,At,te) + — 0<6<1. 


Since this approaches the right hand member of (a) as a limit when either 
t; or tg is taken at ¢ and At—0 the proof of statement (a) is complete. 

The proof of (b) is precisely similar. 

Since the kernel I'(¢, 7) and its partial derivatives to those of order 
(n—1) are of the type of F(é, 7) of the lemma, we may differentiate the right 
hand member of the relation 


n times. This, together with the continuity of the resulting terms on the 
right, establishes the existence and continuity of d"y(t)/di" on the interval 
(a, B). 

3. The change of variables. By hypothesis the function ¢(/) maintains 
its sign on (a, 8). With a proper distribution of the constant factors between 
p and LF it will follow that g(t)>0. We assume such a distribution. Then 
the continuous functions 


are monotonic increasing and may be used as new independent variables. 
Introducing them in equation (1) and denoting by f(x, &) the function into 
which f(t, 7) is transformed by the inverse of (2), we obtain the form 


b 


in which 
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8 
(4) h=—p, b= ff 
4 = 
5 It is readily verified that in the regions 


and on the boundary =, Q(x, &) is possessed of properties analogous to 
those under (i) above. Moreover 


O(x,£) =-—1. 
Defining ¥(x) now by the relation 
2.(x,é) = v(x) 


and introducing in (3) as a new dependent variable the function 


u(x) 


we obtain as the final form of the equation 


where 
K (x,t) = Ox, 
| This kernel K(x, &) and its partial derivatives are undefined on the line 


£=x. We shall complete their definitions by designating them to have in the 
points of this line their limiting values as these points are approached in the 
region Rj. 

4. The normalized equation. The equation (5) will be said to repre- 
sent the norma] form because of the following characteristics which its kernel 
is found to possess. 

(A) The function K(x, £) is possessed of partial derivatives to those of 
¥ order m21 inclusive, which are continuous in the region R, and in the 
closed region 


| 
| 
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(B) K(2,8) | —1; 


= 
f=2+ 


(c) | 0. 


It may be observed that the differentiation of (B) and the application of 
(C) yield the further relation 
©) | "=o. 
THEOREM 1. Every integral equation of the type (1) with a kernel possessed 
of the properties (i) and (ii) above may, by a suitable change of variables, be trans- 
formed into an equation of the same type with a kernel possessed of the properties 
(A), (B), (C) and (D). 


CHAPTER 3. TRANSFORMATION OF THE INTEGRAL EQUATION INTO 
AN INTEGRO-DIFFERENTIAL SYSTEM 


5. The auxiliary differential system. We consider in connection with the 
normalized equation the differential system 
(6) yy’ (x) =0, 

uy(a) + vy(b) = 0. 

The constants uw and vy are parameters, the choice of which is arbitrary, 
subject to restrictions to be imposed at certain points in the subsequent 
theory. In each case, however, only discrete values of the ratio w/v will be 
excluded. 

To begin, let u and v be chosen so that 


(7) ~ —1. 


Then system (6) is incompatible and possesses a Green’s function G(x, £). 
This function and its derivatives are undefined on the line §£=x where 
G(x, &) is discontinuous. We shall complete their definitions in the region R, 
composed of R; and Rj, by designating them to have in the points of the 
line =x their limiting values as these points are approached in the region 
R,. We may then enumerate their characteristics as follows. 

I. The functions G(x, £), G.(x, £), and G;(x, &) are continuous in R; and Ri ; 


Gz) | = 


G(x,t) = G,(x,t)= 0; 
uG(a,t) + vG(b,t)= 0; 
vG(x,a) + uG(x,b) =0; 


(October 
ITI. 
IV. 
Vv. 
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VI. the solution of the system 


= f(x), 


pw(a) + vw(b) = 0 
is given by 


b 
= f G(x,t)f(t)dt ; 


VII. G(x,f) = — H(é,x), 


where H(x, &) is the Green’s function of the system adjoint to (6). 

6. The construction of K (x, £). It will be observed that G(x, &) and the 
kernel K(x, ~) of the normalized equation (5), and their respective first 
partial derivatives, have the same discontinuities in the region R. We shall 
construct with the use of K(x, £) another function K(x, £) which maintains 
these characteristics, and in addition shares with G(x, &) its properties IV 
and V. 

We assume now that for the given equation 


(iii) n= 2, 
and that in the normalized form 
(iv) | K(a,b)| +|K(c,a +) + K(b,b)| +| K(b,0)| #0. 
Then yu and v may be so chosen, subject to previous restriction, that 
(8) A = p*K(a,b) + wv[K(a,a + ) + K(b,b)] + v?K(b,a) 0. 
We assume such a choice, and set 

vK(x,a) + wK(x,b) = W(x), - 

uK(a,£) + vK(b,£) = V(é). 


Clearly these functions W(x) and V(é) are possessed of continuous deriva- 
tives to those of order ” on the interval (a, b). The function K(x, &), defined 
by the formula 


(9) 


1 


(10) K(z,%) K(x,t) W(x) 


is, therefore, found to possess the properties (A) to (D) of the kernel K(x, &). 
From the relations 

uW(a) + »W(b) = A, 
vV(a) + uV(b) = A, 


(11) 


591 
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it follows further that 
(12) 


uK(a,t) + vK(b,t) = 0, 
vK(x,a) + wK(x,b)=0. 


7. The relation between K(x, £) and G(x, £). If x is regarded for the 
moment as a parameter, the function 


w(x,£) = K(x,&) — G(x,é) 
is continuous together with its first derivative in —. Since by III, V, and (12) 
it is also a solution of the differential system 


Ow(x, 


v(x, a) + poo( x , b) =0, 
it is uniquely determined as such, and is by VI expressible in terms of the 
Green’s function H(x, £). Introducing G(x, £) by virtue of VII, we may 
write therefore 


(13) K(2,t) —G(2,8) = — f Ki(x,t)G(t, 


This relation, if £ is now looked upon as the parameter, is in form an integral 
equation for G(x, £) as a function of x. 

8. The solvability of the relation between K(x,¢) and G(x, £). A sufficient 
condition for the solvability of equation (13) for G(x, &) is that the Fredholm 
determinant D for the kernel K;(x, £) shall differ from zero. This kernel, 
and hence also D, depends upon the parameters uw and v. We shall show that 
under the assumption, which we now make, that the given equation 
is such that 


(v) D # Oinywand 


it is always possible to choose these parameters subject to previous restric- 
tions, so that D0. 
We set 

K;(x,£) E(x, &) 

K (x,t) = 
and denote respectively by D and D(x, &) the Fredholm determinant and 
first minor of the kernel E(x, £), and by D and D(x, &) the corresponding 
expressions for the kernel €(x, ). We have* 

n=l n=l 


* Bécher, An Introduction to Integral Equations, Cambridge University Press, 1909, p. 32. 


4 
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where, if we omit writing the arguments and indicate them only by their 
subscripts, thus: E(é;,f;)=£;;, then 
a 
d, = cee «@ 
i n! dé, -- 
Ey Ei, n42 
E33 - Es,n+2 
dn = 
* 


We shall omit writing down the analogous formulas for D and D(éi, &), 
since they may be obtained by replacing in those above dn, d,(f:, 2) and 
E respectively by dn, dn(&1, 2) and €. 


Consider the term d,. By (10) we have 
E(x,£) = E(x,t) + o(x)¥(é), 


where 


W(x)V 
= 


Substituting this form and expanding the resulting determinant into a 
sum of determinants with monomial elements we obtain 


—1)" b b re 
d, = ( ) dg, dé, 
+ dt, - - -d&. 


| 
| 
| 
| 
] 
| 
| 
| 
| 
| 
| 
| 
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Of the terms on the right the first is simply d,. The remaining sum we shall 
reduce by the following manipulation. In the determinant of the 7th term 
of the sum, let the ith row and column be shifted into first places, and let 
the arguments be renamed as follows: 

£, to become &4:, for k=1, 2,---, (¢—1); 

&; to become ; 
the others to remain unchanged. Since these arguments are all variables of 
integration this change in their designation amounts merely to a change in 
the order of integration, and this is immaterial. The terms of the sum have 
thus been made identical, and hence the entire expression reduces to 


- 

n: a a 


Ene 


This we expand by the elements of the first column to obtain 


J 


Ese - Eon 


for 


Observing that the first term of this is 


f 


we proceed to a further reduction of the remaining sum. Let the (7—1)th 
column in the respective term (i.e., that in which 7 is the second subscript) 
be shifted into first place and let the arguments again be renamed so that 
£, becomes £41, fork=1,2,---,(j—1), 
£; becomes §&), 


* 
+ 
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and the others remain unchanged. This again makes the terms of the sum 
identical, and the expression takes the form 


This is precisely 


&2n 


b b 
-ff fe, £1) dE ides. 


Collecting our results, then, and summing the d, to obtain D, we find that 


1 


By use of Fredholm’s relation 
b 
E(x, £)D + f = D(x, 


and the formula 
= + vE(b, £1), 
which follows from (9), the final form 


1? 
D=D - =f +> 


is obtained. We observe now that in this relation D and D(x, £) are free 
from the parameters y and », since they depend only on €(x, £). It follows 
that D is rational in » and v. The condition D#0 in y, v is found directly 
to take the explicit form 


(va) | + |0(a,a +) + 0(b,b)| + | 0(b,a)| #0, 


where 


h 
(x,y) = K(x,9)D — f D(x,8)K(E, 


and D and D(z, £) are the Fredholm determinant and first minor of the kernel 
K(x, €). Except in the case that this inequality fails, the parameters » and 
v can be chosen so that D0. 


+ 
le, 
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The method of manipulation used on D can be employed equally well in 
the analysis of D(é,, 2). It may be shown in this way that D(£,, &) is also 
rational in yw and vp. 

With y» and » chosen so that D0, the existence of the kernel F(x, &) 
reciprocal to E(x, £) is assured. Writing the relation (13) then in “solved” 
form we have 


b 
(14) G(2,t) = K(x,8) — f 
9. Characteristics of F(x, £). Since the kernel E(x, £) is continuous in 
R, it follows that F(x, £) is likewise continuous in R. The behavior of the 
partial derivatives of F(x, £) may be determined as follows. Differentiating 
Fredholm’s identity 


b b 
(1s) E(x,8) + F(x, 8) = f E(x,1)F(t,8)dt = f F(2x,t)E(t,8)dt, 


we obtain 


b 
+ = f 


Ex(x,t) + = f F(x,t)Eq(t,£)dt. 


Since the right hand members of these relations are obviously continuous 
it follows that F,(x, £) and F;(«, £) are continuous in R, and R; , and hence 


that 


| | ’ 


t=—z 
f=2+ 


=~ | 


(16) 


In similar manner the higher partial derivatives may be considered. It will 
be found that F(x, £), like E(x, £), is possessed in Re and R; of continuous 
partial derivatives to those of order (n—1). 

From the formula 
D(x,£) 


F(x,&) D 


it is seen further that F(x, £) is rational in the parameters p and v. From 
Fredholm’s identity above and the relation 


(17a) pE(a,t) + vE(b,t)= 0, 
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obtained by differentiation from (12), we draw lastly that 
(17b) uF (a,t) + vF(b,t) = 0. 


10. The transformation of the integral equation. Let u(x) represent, 
now, any solution of the normalized equation (5). Multiplying (14) by 
du(é) and integrating with respect to £, we obtain 


6 
(18) af = f K(x,8)u()dé — f F(x,t){ ab. 


In this the integral which occurs in both terms on the right may be evaluated 
by multiplying (10) by Au(é), integrating, and utilizing (5). With the result 
so obtained and with the abbreviation 
(19) L(u) = wu(a) + vu(d), 
the equation (18) reduces to the form 
W(x) 

A 


J 


b 
(20) d f G(x,8)u(Q)de = u(x) — L(u) 


A 
We shall abbreviate this result by setting 


The function (x) so defined is continuous and has a continuous (m—1)th 
derivative on (a, 6). For subsequent use we observe that 


(22) + = 1. 


Introducing ®(x) into (20) we may write that relation 
3) ala) Gx, = + f Fe, 
By differentiation this yields finally the equation 
u'(x) — du(x) = L(u)o(x) + J 


where we have set 
g(x) = &(x), 


24 
f(x,&) = F(x, 8). 


597 
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Lastly adjoining to the equation thus derived a second equation— 
obviously satisfied by every solution of (5)—we may formulate the result 
as follows. 


THEOREM 2. Jf the kernel of the normalized integral equation (5) satisfies 
the conditions (iii),(iv),(v), above, then every solution of the equation is also 
a solution of the related integro-differential system 


(a) — \u(x) = L(u)e(x) + f 
(25) 
(b) Lid f V(t)u(t)dt. 


The functions V(t) and L(u) involved here are given respectively by (9) 
and (19) above. We remark that the function g(x) is continuous together 
with its derivatives to that of order (n—2) on (a, b), while f(x, £) is con- 
tinuous with its partial derivatives to those of order (n—2) in R: and Ri ‘ 

11. The associated integral equation. Deductions analogous to those 
above may also be made for the integral equation 


o(2) = f K(, 


associated with equation (5). To accomplish this most easily we shall 
set —K(é, x) = K(x, £) and write the equation in the form 


6) = f 


which is readily found to be its normal form. The deductions already made 
become applicable to equation (5), then, if \ is replaced by —X and if the 
functions involved are taken to spring from the kernel K(x, £) rather than 
from K(x, £). When this is the case we shall indicate it by superscribing the 
various functional symbols with a bar. 

The relation between the two developments is more easily followed 
if in passing from the treatment of the given equation to that of the associ- 
ated equation the parameters yw and »v are interchanged. We shall suppose 
this done. Since A = —A, then, the condition A+0 has already been met. 

It is, however, necessary to impose upon the kernel of the given equation 
the condition that 


(v) D +0 in » and uz. 
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The explicit form of this condition as it is obtained from (va) is that 


(va) | (a,b) | +| +, a) + 8(b,b)| + |8(b,0)| 0, 
where 
= K(x, D— 
and D and D(x, £) are the Fredholm determinant and first minor of the 
kernel — K;(&, x). 


When condition (¥) is satisfied, every solution of the equation (5) is 
found to be a solution of the system 


b 
+ do(2) = + f 


In this ° 
L(v) = vo(a) + yo(d), 


and to obtain the values of f(x, £) and g(x) in terms of familiar symbols we 
may proceed as follows. 
By direct substitution it is found that 


W(x) = — V(x), — WE), 
K(x,é)= K(é,x), E(x,t) = K;-(é, x). 


Substituting this value of E(x, £) in Fredholm’s identity 


b 
E(x,) + F(x,%) = f E(x,t)F(t,8)dt 


and differentiating with respect to x we obtain 
b 
— + — f Kult, 


In this we may replace K;,(£, x) by its equivalent E,(é, x). Then further 
integrating the right hand member of the equation by parts we find that 


b 
— Eeé,x) + f(x,t) = — E(b,x)F(b,£) + E(a,x)F(a,&) + f E(t, x)f(t,t)dt. 


599 
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Because of (17a) and the formula analogous to (17b), however, the sum of 
the first two terms on the right vanishes. Hence we have the relation 


— f(x,§) = — f f(t, E(t, x)dt. 


On the other hand the interchange of the arguments in (15) and the subse- 
quent differentiation of that relation yields 


E,(€,x) + f(é,x) = f f(E, x)dt. 


A comparison of the two results shows that 


f(x,t) = — 
With this result and the formulas already noted above we find readily 


b 
(26a) Al a) +f 
Lastly we observe that 
(26b) | 


= 12,8) | 


f—z— 


CHAPTER 4. THE EQUIVALENCE OF THE INTEGRO-DIFFERENTIAL 
SYSTEM AND THE INTEGRAL EQUATION 


12. The transformation of equation (25a). To establish the equivalence 
of system (25) and equation (5) it remains to show the converse of the 
theorem of the preceding section, namely that every solution of system (25) 
is also a solution of equation (5). To do this we shall retrace the steps taken 
above and so deduce the equation (5) from the system. 

Consider first the differential system 


y'(x) = o(x), 

wy(a) + vy(b) = 1. 
Because of (7) it is possessed of a unique continuous solution. By (22) and 
(24), however, (x) is such a solution. Hence the system serves to determine 
#(z) uniquely in terms of g(x). In similar fashion because of (17b) and (24) 


the system 
x , £) 
= f(x,8), 
Ox 


pw(a, + €) =0 


serves to determine F(x, £) uniquely in terms of f(x, &). 


(27) 


q 
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Let u(x) now be any solution of equation (25a), and with the functions 
@(x) and F(x, ~) determined construct 


b b 
o(2) = — f G(x,8)u(2)dt — f F(x, 8)u()dt — L(u)®(2). 


This function is obviously continuous. Moreover, it is a solution of system 
(6), for we find on the one hand by differentiation the relation 


b 
o'(2) = — — f — L(u)o(2), 


of which the right hand member vanishes by (25a), while it follows on the 
other hand from property IV of G(x, £) and from (17b) and (22) that 


po(a) + vo(b) = 


The system (6) is incompatible, however. Hence o(x)=0, that is, u(x) 
satisfies equation (23). 

The function F(x, £) was originally derived in Chapter 3 as a reciprocal 
kernel. Hence it is itself possessed of a reciprocal, which is, moreover, pre- 
cisely E(x, £) of the preceding chapter. The integral equations 


b 
6(x) = + f F(x,t)0(t)dt, 
(28) 
= G(x,£) + f F(x,t)o(t,£)dt 


are, therefore, uniquely solvable, and since by (21) and (14) we have as 
solutions respectively W(x)/A and K(x, &), these functions are uniquely 
determinable. We have at our disposal, therefore, the relations (14) and (21) 
satisfied by them, while the substitution of (21) in (23) yields further the 


relation (20). 
Let (14) be written, now, in “solved” form, thus: 


6 
From this we obtain upon multiplying it by \w(£) and integrating it with 
respect 


« 
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We shall eliminate from this the quantity 
b 
vf 


by substituting for it its value as given by (20). In this way we find, upon 
collecting terms, that 


x) 


K(x, = u(x) — L(u) 


This reduces, however, for by Fredholm’s identity (15) the first factor in the 
integrand on the right vanishes. It follows that every solution of (25a) 
satisfies also the equation 


(29) u(2) = f + L(u) 


13. Application of equation (25b). If the solution u(x) of equation (25a) 
satisfies also (25b), the value of Z(u) may be eliminated between (25b) 
and (29) above. The result obtained is that 


W(x)V(E 
(30) u(x) = f + 


and since the bracket in the integrand is, by (10), A(x, &), this is precisely 
equation (5). This result we formulate as follows. 


THEOREM 3. Under the hypotheses of Theorem 2 every solution of the integro- 
differential system (25) is also a solution of the related normalized integral 
equation (5). 


The proof of the equivalence of system (25) and equation (5) is thus 
completed. 

14. Generalization. In the deductions of this chapter we have been 
concerned hitherto with the particular system (25) derived in Chapter 3 
from equation (5), and because of this we were able to identify as previously 
known functions the solutions of the various defining differential systems and 
integral equations. In any case, however, we see from equation (30) and the 
fact that W(x)/A and K(x, &) there involved are solutions of the respective 
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equations (28), that the kernel of equation (30) is itself given as the solution 
of the integral equation 


(31) K(x,&) = [G(x,t) + (x) +f F(x, OK (t,é)dt. 


It is clear from this that the method may be applied to the transformation 
of the general system of type (25), provided that the function F(x, £) 
defined as above is possessed of a reciprocal. We may state therefore the 
following theorem. 


THEOREM 4. If an inlegro-differential system of type (25) satisfies the 
condition that the solution of its related differential system (27) is possessed of 
a reciprocal, then every solution of the integro-differential system solves also 
a related linear integral equation of the second kind. 


15. Theassociated system. Clearly all the methods employed above 
are applicable to the system (25) as well as to (25). Hence we may conclude 
that system (25) is equivalent to the equation (5). 


CHAPTER 5. ‘THE EXISTENCE OF A SOLUTION OF THE INTEGRO-DIFFER- 
ENTIAL EQUATION FOR LARGE VALUES OF THE PARAMETER 


16. Matters of notation. For the sake of simplicity in the formulations 
and deductions of this and subsequent chapters we shall make here certain 
conventions of notation. To begin with, the relation |A|>N shall be inter- 
preted as an abbreviation of the statement “‘|A| sufficiently large.” It is to 
be understood, therefore, that N does not necessarily mean the same con- 
stant in any one case as it does in any other. Further we reserve B(x, &, d) 
as a generic symbol for functions which for (x, £) in the region R and |\|>N 
are possessed of the following properties: 

(a) B(x, &, X) is uniformly bounded, i.e. 

|B(x, )| < B (a constant); 

(b) B(x, &, X) is integrable in £ uniformly in x and X, that is, if the interval 
(a, b) is subdivided by the points §:=a<,<i.< -- - <&,=bin any manner 
so that |f;—£,_1| approaches zero as n—, and if U,(x, \) and L,(x, d) are 
respectively the upper and lower bounds of B(x, ~, \) on the sub-interval 
(&:-1, &;), then when any constant is prescribed there exists a numbe 
such that 


| — — | < 


for n=n,. 
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Lastly o and ir will be used to designate the real and pure imaginary parts 
of A, thus: \=o+7r, and a constant designated M shall be understood to be 


positive or zero. 
17. Lemmas. For use in the subsequent deductions we begin by estab- 


lishing the following lemmas. 
Lemma 1. If aSzSb and 


I(x,2,d) = 


then 


uniformly in x and z and the argument of d. 


To establish this let the interval (a, b) be subdivided in the manner de- 
scribed under (b) above, the point z being taken as one of the points of 
subdivision, ie. =z. Then if £; is any point of the interval (£,1, &;) we 
have for £ on this interval 


| B(x ,&,d) | < Ui(x,d) Li(x,d). 


Let I(x, , \) be written now in the form 


Then since : 


foro SM, aStsesbd, 


we have 


[U(%,d) — Li(x,d)] 


oa) ‘ 
Sy 
|r| int 


— Lix,d)}(& — 1). 


= 
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Now let e>0 be arbitrarily chosen. Then since k <n the final sum will, by 
hypothesis (b), be less than e for m2>m,, and we have 


| I(x,z,d) | < ~ 
Since m is fixed when e is chosen, we may by making |A| sufficiently large 


reduce the right hand member of the inequality to less than 2e“°~%e. 
This proves the lemma. 


Lemma 2. If 
f 
b 
then 
lim, I, =0 


uniformly in x and z and the argument of d. 


The proof of this lemma is in every way similar to that of Lemma 1. 
18. Transformation of the integro-differential equation. _ We consider 
now the integro-differential equation 


(32) — u(x) = L(u)o(x) + f + E(x, ). 


The symbol E(x, d) is to be considered generical, designating a uniformly 
bounded function which is analytic in \ for |A|>N. The equation (32) 
reduces to equation (25a) in the special case E(x, 4)=0. For subsequent 
applications it is essential to consider the more general case. 

If we consider equation (32) for the moment as a non-homogeneous 
equation of the type 


u' + pu =q, 


it follows from the theory of differential equations that it may be written 
in the form 


z 
(320) u(2) = + f + f + 


the limit + being any constant on the interval (a, 0), and the coefficient ¢ 
being constant with respect to x but otherwise arbitrary. 

For purposes of orientation we shall proceed to deduce the consequences 
of the assumption that for such choices of the constants as will be made, 
equation (32a) is possessed of a solution. This deduction will be made 
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separately for the cases in which \ is confined to a left hand half-plane 
o <M, and in which ) is confined to a right hand half-plane o=2 —M. 

19. Heuristic deductions when o<M. Let the arbitrary elements in 
(32a) be chosen in this case as follows: 


*=da, c=ce™, 


where ¢c, is any absolute constant. The equation becomes, then, 


z 6 
(32b) u(x) = +f +f S(t, + a) at 


Substituting the right hand member of this for u(x) in the expression L(), 
we obtain, upon solving for L(x), 


(33) L(u) = + f at, 


C= E rf 


This solution is possible if |A|>N. For, since g(€) is a function of the type 
denoted by B(x, &, \) the integral in the expression for C decreases to zero 
as |A| increases, by Lemma 1. Hence C0 for |A| >N. 

With the value of Z(u) thus obtained equation (32b) takes the form 


where 


b 
(32c) u(x) = (x,d) + f 
where 
O(x,r) = +f E(t ,d)dt 


( z 


and 


Ox, = f par + f f(t, f 


The assumed solution of equation (32b) must, therefore, satisfy also equa- 
tion (32c). 

20. The existence of a solution of equation (32) for o<M. Returning 
now to equation (32a) we may construct, by the respective formulas above, 
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the constant C, the functions 0(x, \) and Q(x, &, A), the equation (32c), and 
the series 


b 
(34) + f 


b 
+ f + 


In the formulas for 6(x, A) and Q(x, &, A) the integrals involving E(é, d) 
are uniformly bounded. The remaining integrals may be made arbitrarily 
small by taking lr| >WN, since f(x, &), like g(€), is a function of the type 
B(x, &,X). Lastly e i; bounded, since Hence for >N 


| <A (a constant ), 


1 
| 


and the terms of the series (34) are in absolute value less than the corre- 
sponding terms of the series 

A A 

2 2? 
The series (34) converges, therefore, uniformly to a function numerically 
less than 2A. Since the terms of (34) are continuous in x and analytic in A 
it follows that (34) converges to a function u(x, \) which is uniformly 
bounded, continuous in x and analytic in A. On the other hand by the classi- 
cal theory of integral equations* this function u(x, \) is a solution of (32c). 

We observe that if E(x, \)=0, then the choice ¢,=0 leads to @(x, 4) =0. 
The solution found is in this case u(x, \)=0. We shall suppose, therefore, 
in proceeding, that c; has been so chosen that |c,|+|E(«, \)| 0. 

Let the right hand member of (32c) be substituted now for u(x) in the 
expression L(u). The result is found to be precisely (33) above. The right 
hand member of (33) is contained, however, in the right hand member of 
(32c). The elimination of it between the two equations yields equation 
(32b), which is, therefore, satisfied by the function u(x, \) found. Lastly 
differentiating (32b) we obtain equation (32), which completes the proof 
that for ¢<M, |A|>N, equation (32) is possessed of a solution which is 
uniformly bounded, continuous in x, and analytic in X. 


*Bocher, loc. cit., p. 15. 
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21. The case s=>—M. When d is to be confined to a right hand half 
plane c= —WM the arbitrary elements in (32a) may be chosen as follows: 


= 


where cz is any absolute constant. Reasoning precisely analogous to that 
followed in § 19 and § 20 serves, then, to establish the fact that for c7={ —M 
and |A|>N equation (32) is also possessed of a solution with the char- 
acteristics noted above. 

22. The associated equation. The discussion of the equation obtained 
from (25a) by adding on the right a function E(x, \) may be carried through 
precisely like the discussion just concluded. In this way it is found that for 
\ confined to any half-plane ¢ < M or ¢= —M, and |A|>N, such an equation 
also is possessed of a uniformly bounded solution which is analytic in A 
and continuous in x. 

Summarizing the results of this chapter we have obtained the following 
theorem. 


THEOREM 5. If the complex parameter d is restricted to a region bounded 
by a line parallel to the axis of imaginaries and exterior to a circle sufficiently 
large and with center at \=0, then the integro-differential equation (32) admits 
of a solution which is uniformly bounded, continuous in x, and analytic in X. 


CHAPTER 6. THE FORMAL SOLUTION OF THE INTEGRO-DIFFERENTIAL 
EQUATION 


23. Alemma. In order to preclude interruption of the deductions about 
to be made we begin by establishing the following lemma. The symbol 
H (x, &, ) will be used to designate a function which is merely bounded 
uniformly for |\| >, and is integrable in x and &. 


Lemma 3. If 2; and 22 are any points of the interval (a, b), and 


= f "'H(&, t,d)dt, 


then the functional form of this integral is given by 
I(21,22,€,2) = eH (z2,£,2) + eH (21,£,2). 
This lemma follows almost immediately from the preceding ones. Thus 


the integral may be written in the form 


1(21,22,£,) anf e~Ma-O A (Et, r)dt, 
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where * denotes any constant on the interval (a, b). If -c<M we may 
choose *=a. Then the factors of the integrands on the right are all uni- 
formly bounded. The same is therefore obviously true of the integrals 
themselves. On the other hand if —c=—M we may choose *=), with the 
result that the integrals are again uniformly bounded. This proves the 
lemma. 

24. Heuristic investigation of the functional form of a solution of (25a). 
The existence of a solution of (25a) has been shown. We shall suppose 
now for purposes of orientation that such a solution may be obtained by the 
method of successive approximations, and in particular by the approximating 
scheme defined by the formula 


b 
(35) — dAus(x) = +f f(x, 


This formula is in the form of a differential equation for u;(x) when u;~1(x) 
is known. 

With the initial approximating function ~,=0 we have as a possible 
choice u;=e™. Substituting this in the right hand member of (35) and 
applying Lemma 3 we find that u2(x) is given by an equation of the type 


(35a) to’ (x) — = e*H(x,r) + e*H(x,d). 


The formula 


yields a solution when the * is chosen arbitrarily for each term of the inte- 
grand. We obtain a particular solution 2(x), then, by choosing +=) for 
the terms of which e” is a factor, and «=a for the terms which contain e” . 
By Lemma (3), then 


(36) uo(x) = eH (x,d) + e*H(x,d) + e*H(x,d). 


Upon the substitution of this as u;-:() in (35) and the application of 
Lemma 3 we find that ~;(x) satisfies an equation of precisely the type (35a). 
Hence there exists a “3(x) of the same functional character as “2(x) given by 
(36). Obviously the same argument may be applied in the course of each 
successive substitution, and since we may consequently choose “,(x) for 
every 1 of the form of “2(x) above, the convergence of the procedure would 
lead to a solution of equation (35a) of the form (36). Guided by this result 
we proceed to substitute the form (36) into the equation with the purpose of 
determining as far as possible the functions H(z, \). 
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25. The formal determination of coefficients. The undetermined func- 
tions H(x, \) are bounded for |A|>N. We shall proceed on the hypothesis 
that they are expansible in power series in 1/A, and shall write the assumed 
form of the solution of (25a) 


(37) u(x) = e*[y(x)] + e*[8(x)] + &*[a(x)]. 


In the notation used here 


= 


The result of the substitution of form (37) in the equation (25a) involves 
the integral 


6 
I= f 8) 


We shall consider this in two parts, namely 


n=f (2,2) 


k=O 


k=on—1 


Integrating the kth term of J, by parts (n—2—k) times, we obtain the for- 
mula 


n-3 — OS fré — efré(x,0) 


k=O iO 


— 


6 n—2 
a k=0 


The notation has been abbreviated here so that 


f(x, 


| = 


O*fy 


= fré (x,&). 


and 
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The substitution of (37) in the equation yields therefore 


— — — f 


(38) 


1 n—2 b n—2 


k=O 


— 
+f = 0, 


where the coefficients of e* and e” are of the same general character as that 
of 

Equating now to zero the coefficient of e*/X' for /=0, 1, 2,---, (n—2) 
respectively we find that yo’=0, whence we may choose yo=1, and that the 
functions 7:(x) y2(x), -- +, Yn-2(x) can be successively determined each by 
means of a quadrature. Continuing in similar fashion and equating to zero the 
coefficients of and for 1= —1, 0, 1,--- , (w—3), respectively, 
we find likewise that 


Bo = 0, 
Bi(x) = — ve(x), 


aga= 


= — pe(x), 


and that B2,--+ , , may also be successively evaluated. 
Hence we may construct the function #(x) given by the formula 
a(x) = e* ‘ > 
(40) 


26. The equation satisfied by a(x). The deductions above have been 
purely formal. A consideration of formulas involved shows, however, that 


(39) 
etc. 
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since g(x) is possessed of derivatives of order (x —2), while f(x, &) is possessed 
of partial derivatives of the same order, y;(x) may be differentiated (—7) 
times, while 6;(x) and a,(x) are differentiable (n—1—7) times. The function 
ti(x) determined is, therefore, differentiable and can be substituted into equa- 
tion (25a). 

The result of this substitution may be read directly from (38) above if the 
coefficients with subscripts greater than (n—2) are taken now to be zero, 
and the symbols [y(x)] etc. are interpreted as representing the respective 
polynomials instead of the formal infinite series. The coefficients have been 
explicitly determined so that the terms in e*/\‘, e/d*-!, and e*/A‘-! vanish 
fori=0,1,---, (w—2). Since the integral J, of (38) does not now appear, 
there remain merely a term of the form B(x)e”/A"-*, an analogous term in 
e=/x"-2, and the integral 


( om 1)"-? b n—2 

a k=O 

Writing this integral in the form used in the proof of Lemma 3 we find readily 

that the substitution yields the result 


(41) 
{e**B(x,d) + eB(x,d)}, 
where, moreover, the functions B(x, \) are analytic in X. We shall call the 
function a(x) a formal solution to the order (7—2) of equation (25a), since 
for such values of \ for which the exponential factors in (41) are bounded the 
equations (41) and (25a) agree to terms of degree (n—2) in (1/d). 

27. The associated equation. Equation (25a) differs from equation 
(25a) in that each function involved is replaced by the corresponding function 
superscribed with a bar, and in that \ is replaced by —A. With these altera- 
tions formula (40) above yields the formal solution of (25a). Because of 
relation (26b), therefore, this formal solution is 
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the function y:(x) occurring here being that which occurs also in formula 
(40). 


CHAPTER 7. THE SOLUTION OF THE INTEGRO-DIFFERENTIAL EQUATION 


28. Relation of the formal solution to a true solution fors<M. By the 
conclusions of § 20 equation (25a) admits, for ¢<M, |A|>N, of a solution 
u(x) which satisfies the relation (32b) with c, any non-vanishing constant, 
and E=0. Such a solution will not vanish for x =a, and hence upon being 
multiplied by a suitable function of \ alone, will satisfy the condition 


u(a) = a(a), 


where a(x) is the formal solution (40). This multiplication by a function 
independent of x is permissible since the equation (25a) is homogeneous. 
Inasmuch as the solution of (32b) originally chosen is analytic in for 
|\| >, while the same is true of the formal solution #(x), the analyticity 
of the final form (x) is assured. 

Consider now the function 


(42) w(x) = u(x) — a(x)}. 


The bracket on the right obviously satisfies equation (41). Hence w(x) is a 
solution of an equation 


w!(2) — dw(2) = L(w)e(2) + f + + B(z,d)}, 


the functions B(x, \) being analytic in X. This equation is of the form (32) 
since for o<M the bracket on the right is of the type E(x, \), and hence we 
may apply to it the deductions of § 19 and § 20. Since w(a)=0 by (42), 
we find from the form (32b) that we have before us a case in which the 
coefficient ¢, of § 19 is zero. The function @(x, \) reduces, therefore, in this 
case to 


@(x,d) = f { B(t,d) }dt 


b s 
f { + B(t,d) }dt f 
and hence is, by Lemma 1, of the form ¢(x, \), where this symbol is used to 
denote a function which approaches zero uniformly in x as |A| >. In 
consequence of this, as may be drawn from § 20, w(z) is itself of the form 
€(x, A). 


: 
£ 

| 

| 

4 
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Solving (42), then, for u(x) we obtain the result that there exists for 
<M, >N, asolution of the form 


e(x,A)e™* 


u(x) = a(x) + 


This solution is moreover continuous in x and analytic in d. 

29. The solution for c= —M. The deductions of § 21 establish the exis- 
tence of a solution of equation (25a) which for ¢= —M, |A| >N, is analytic in 
X, continuous in x, and non-vanishing atx=b. Because of the homogeneity 
of (25a) this solution may further be made to satisfy the condition 


u(b) = 2(d). 


We may conclude, then, in a manner entirely analogous to that of § 28, 
that the function 


a(x) = u(x) — a(x) } 


satisfies an equation of the type (32) and, since it vanishes at x=), is of the 
form ¢e(x, \). From this it follows further in the manner of § 28 that there 
exists a solution of equation (25a) which, for |A|>N, ¢2 —M, is analytic 
in \ and continuous in x, and which is of the form 


Inasmuch as zero is a particular function of the type e(x, A) the results 
above may be summarized as follows. 


THEOREM 6. If the complex parameter d is restricted to a region bounded 
by a line parallel to the axis of imaginaries and exterior to a circle sufficiently 
large and with center at X=0, then the inlegro-differential equation (25a) 
admits of a solution of the form 


u(x) = {1 + 


ref 


This solution is analytic in \ and continuous in x. 


e(x,A)er” 
u(x) = a(x) + - 
7-2 
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30. The associated equation. Clearly the analysis applied above to the 
equation (25a) and its formal solution (40), may be applied equally well to 
the associated equation (25a) and its formal solution (40). In this manner 
it is found that when d is confined to any right or left-hand half-plane and 
\\| >, the equation (25a) admits of a solution v(x) which is continuous in 
x, analytic in A, and of the form 


Bn—2(x) 
n—2 


a@,—2(x) e(x,d) 

We remark lastly that since the coefficients and functions involved in 
equation (25a) and (25a) are all real, conjugate imaginary values of will 
correspond to the conjugate imaginary solutions u(x) or v(x). Thus if we 
denote by X., u.(x), and v(x) the conjugates of \, u(x), and v(x) respectively, 


u(x,A.) = u-(x,d), 
v(x,Ac) = 


CHAPTER 8. THE CHARACTERISTIC VALUES 


31. The characteristic equation. The immediately preceding sections 
have been concerned with the form of a solution of equation (25a). By 
Chapters 3 and 4 such a solution solves the given normalized integral equa- 
tion if and only if equation (25b) is also satisfied. This imposes a restriction 
on the choice of \ which we proceed to consider. 

Substituting for the factor \u(t) in (25b) its value as the second term of 
(25a) we obtain the equation 


L(u) -f — L(u)e(t) — f sit, ay, 


which, if we integrate by parts the term involving «’(t), takes the form 


f A - 


x) Yn-2(X 

v(x) = _ ni) +---+(-—1)* 

X 
a B,(x) 
il 
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In this, A-¢(¢) is given by formula (26a), and 


= V(a) + f 
(44a) 
bs = V(b) + f 


It is convenient to set 


(44b) 


and to write the equation in the form 


(45) a -f w(0) - f | =0. 


We shall call this equation the characteristic equation. Its roots, if such 
exist, are values of \ for which the solution of the integro-differential equa- 
tion satisfies also the given integral equation. We shall call these roots 
characteristic values and denote them by ),. 

We observe that since the coefficients of equation (45) are all real, the 
equation is essentially unchanged if u(x) is replaced by its conjugate imagin- 
ary u.(x). From this it follows, by § 30, that the characteristic values occur 
in conjugate pairs. 

Now for ) in any right or left hand half-plane we have by Theorem 6 a 
solution 


u(x) = e* + + 


which is analytic in \ for |\|>N. The substitution of this in (45) yields for 
the characteristic equation the form 


(46) [ki + €(d) Je** — [ke + €(A) = 0, 


in which the functions ¢(A) are analytic for |A| >N. 

32. A further condition on the given equation. It is essential to the pro- 
cedure which is to be followed in solving equation (46) that neither of the 
coefficients k; and k» shall vanish. Since k; and k2 are seen from the formulas 
to be rational functions of u and v we may choose u and » subject to previous 
restrictions so that k,k20, provided only that neither k; nor kz vanishes 


[October 
_ 
2,(é) = 
_ 
= 
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identically in » and v. We shall suppose then that the given integral equation 
is such that 


(vi) ki(u,v) 0, ko(uv) 0. 
From the formulas for k; and kz it follows immediately that 
(47) vki(u,v) + wke(u,v) = A(u,r). 


It is a clear from this that a sufficient condition for (vi) is that 


A(0,v) #0, D(0,v) # 0, 


and 
A(u,0) # 0, D(u,0) # 0. 


For if these conditions are fulfilled k; and ke are defined for the values (u, 0) 
and (0, v) and if either one vanished identically relation (47) would entail 
a contradiction for or for »v=0. 

The restriction (vi) is an essential one, for we have in the Volterra equa- 
tions, i.e. those in which K(x, )=0 in one of the regions R; and Ro», an ex- 
ample of equations which may satisfy all previous conditions, but to which 
the subsequent results do not apply. To show that these equations are 
tuled out by condition (vi) we may proceed as follows. 

Let us suppose K(x, £)=0 for —>x. Then W(a)=v and v0 because 
of (8). We recall that E(x, £)=K;(x, £), and hence that Fredholm’s identity 
(15) yields the formula 


+ F(a,8) = f 


Upon an integration by parts the right hand member of this assumes the 
form 
6 


of which the first two terms reduce because of (17b) and (12) to F(a, &). 
Hence we may write 


b 
f 


or, upon substituting from relation (10), 


= 


6 
+ f K(a,bat | 


A 7 W(a)L 


ig 
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Multiplying this by W (é) and integrating we obtain the form 


W 1p b 
frof F(t - f Kel, 


— — + f a. 


With the use of formulas (11) and a rearrangement of terms this leads finally 
to the relation 
V(b) — W 


v 


1 ‘ 
@| + f F(t, (a, at 


From this form it is readily found, now, that since K(x, £)=0 for &>zx, the 
second term on the right vanishes, with the result that 
ki + vK(b,2), ke = 0. 
33. The solution of the characteristic equation. We consider to begin with 
the related, simpler equation 
(48) kye** — keer? = 0. 
Because of (vi) we may suppose uw and v so chosen that k,;~0, k.~0, and 


the equation is easily solvable. Thus e**-® =k,/k2, and by taking logarithms 
the roots are found to be 


1 k 
(49) pm = + log =} (m= 0, +1, +2,---). 
b ke 
Solving the equation (46) now formally in the same manner we find 
ky + 


ke + €(AXm)! 


1 
+ log 
b-—a 
Then inasmuch as in functional form 


ky + €(Am) ky 
og = log ke + €(Am), 
we have 
(46a) Am = Pm + €(Am). 
Consider now the function 


[om + e(A) J, 


[October 
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the function « being that of (46a). The function @(A) is analytic, and if 
§>0 be chosen arbitrarily but sufficiently small it will follow that |e(A)| <6 
for |A|>N. If, therefore, a circle C,, with radius 6 be drawn about each of 
the points p,,, then for |m| sufficiently large the point pm+e(A) will lie within 
the corresponding circle C,, whenever is a point on the circumference. As 
d describes this circumference, therefore, the argument of 6(A) increases 
by 27, from which it follows that @(A) =0 has precisely one root within Cp. 
This means if we denote this root by \,, that the points A,, are represented 
asymptotically by the points p», namely 


(50) An = Pm + Em, 


where €2, -, €m, is a sequence of constants such that limm.e €m=0. 
Moreover since the characteristic values occur in conjugate pairs, while p,, 
and p_» are conjugate,* we conclude that 


From the significance of p,, for the given integral equation, and from the 
fact that by (49) p, involves the ratio k:/ks, it follows that this ratio is 
determined by the integral equation alone and so must be independentt 
of the parameters y and ». 

34. A more precise formula for\,,. In deducing the formula (50) we 
assumed only that for the given integral equation »22. If we have at our 
disposal the existence of derivatives of higher order a formula more precise 
than (50) may be deduced. We shall assume, now, for the following deduc- 
tions that for the given integral equation 


(vii) n= 4. 
The assumption (vii) insures the existence of a solution u(x) of equation 
(25a) which is of the form given by (43) form=4. Substituting this solution 


in (45) and integrating by parts the integrals involving ef Q(£) we obtain for 
the characteristic equation the form 


Ri: k ke k 


? 


* It is assumed at this point that k:/ke is positive. If it is negative we may write log (4:/k2) = 
zi + log |ki/ks|, in which case it is evident that p» and p_m—: and hence also A» and A_m—1 are con- 
jugate. To avoid unnecessary complications we shall continue only with the case k;/ke>0, the 
modifications for ki/ke<0 being reasonably obvious. 

t The example of the appendix illustrates this. 


| 
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where the coefficients & are constants and in particular 


b 
kin = + + 22,(a) — 2f — 


(S1) 

Solving this equation formally in the fashion of § 33 we obtain 

kiz his + €(Am) 
Ridm 

han + €(Am) 
1 

kodm 
or, upon expanding the logarithm as we may, because of (50), for |m| 
sufficiently large, the form 


i+ 


log 


Am = Pm + 
b-—a 

A’ representing a constant independent of m. From the formulas (46a) 
and (49) we find readily, however, that 

1 b-—a A”+€(m) 

Am m? 
A” being a constant. If this is substituted in the right hand member of the 
preceding expression we obtain a result which is expressed in the second 
part of the following theorem. 


THEOREM 7. If the kernel of the normalized integral equation (5) satisfies 
the conditions (iii), (iv), (v) and (vi) then there exist infinitely many values of 
the parameter for which the integral equation admits of a solution. These 
characteristic values are of the form 

1 k 
An = {2mri + log} + (m=0,+1,+2,---), 
b-—a ke 
where 
lim = 0. 
+o 

If the kernel of the integral equation satisfies the conditions (iv), (v), (vi) 
and (vii), these characteristic values may be determined more precisely by the 
formula 


k ki2/k1) — 
(50a) A, = + log ah 12/ ki) — +e 


2mri 


(October 
Ris koe 
1 ky k A’ +€(Am 
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The various constants k are given by formulas (44a) and (51), and A also isa 
constant independent of m. 


Obviously if a larger number of derivatives of the kernel of the given 
equation are at hand the method applied above is applicable to the deduction 
of a formula still more precise than (50a). 

35. The associated equation. It is not necessary in this case to repeat 
the discussion for the system (25). By the classical theory of integral equa- 
tions the associated equations (5) and (5) have the same characteristic 
values. Hence formula (50a) serves both. 


CHAPTER 9. THE CHARACTERISTIC FUNCTIONS 


36. The solutions of the integral equation. The characteristic values 
\ by their very determination are such that the functions u(x, \,,) satisfy 
not merely equation (25a) but the entire system (25). As such these functions 
are also solutions of the given normalized integral equation. For brevity 
we shall denote them by ~,,(x), and shall call them characteristic functions. 

The form of the characteristic function “,,(x) is obviously obtained by 
substituting the value \=X,, in the form u(x) given by formula (43). Divid- 
ing this formula by e“ and observing that for |m| sufficiently large e=¢—~ 
is uniformly bounded, we obtain because of (vii) the formula 


Bi(x) ai (x) 4 H(x,Xm) 
Am Am Aw 


thm( x) = + + 


Since by (50a) 


em(z-a) — (1/ 2mxj) ky) — ke) (m) | me] 


x—a@f/ki2 kee H(x,m) 
ky ke m? 


we find in terms of p,, and m 


(52) thm(x) = {1 + + x) B(s,m) 
m m2 
where 
— a) + ka) (x — @) 
(53a) Qx(x) = 
and 


? 


3 i 
i 
¥ 
4 
f 

{ 
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By (48), however, 


ky 
ePm(o-a) = —. 


ke 
Hence the function Q.2(x) is independent of m and substituting for 8;(zx) 
and a(x) their values from formulas (39) we find that 


— (b— a) ky 
ke 
Deductions precisely similar to those above based on the function (43) 
yield the characteristic functions of equation (5). We have first 


Am 


Um(X) = (2 a). 
from which in the manner above we obtain 
x) H(x,m) 


(52) x) = ¢ Pm a) {! 


m m? 


In this Q,(x) is the function occurring also in (52), and 

b-—a ke = 

(536) Qx(2) = E + 
2rt ky 


37. The normalized set. By the classical theory of integral equations 
the set of functions ,,(x), is biorthogonal, that is, 


b 
f Um(x)v,(x)dx = 0, m # p. 


The set of functions is said to be normalized if further 
5 
f Um(X)0m(x)dx = 1. 


Forming the product u(x) vm(x) from formulas (52) and (52) we obtain 


= 1 + 
m m2 


Then since |p,,| is of the order m, while Q2(x) and Q3(x) are possessed of a 
derivative, we obtain as the result of an integration by parts 


H(m) (i 2,3). 


m m? 


(October 
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Accordingly 
H(m) 


m? 


= (b—a)+ 


We may therefore normalize the solutions given by (52) and (52) by multi- 
plying them by a suitable factor of the form 
1 H(m) 


Vb—a m? 


Thus we obtain the following theorem. 


THEOREM 8. If the kernel of the normalized integral equation (5) satisfies 
the conditions (iv), (v), (vi), and (vii), and the kernel of the associated integral 
equation satisfies condition (V), then the normalized characteristic solutions of 
these equations are given respectively, for |m| sufficiently large, by the formulas 


Um(x) = 1 {1 ¥ + Q3(x) 4 


—a m m m? 


Finally we remark that by § 30 and § 33 the functions u(x) and u_m(x), 
as well as the functions v,,(%) and v_»(x), are conjugate imaginary. 


(54) 


CHAPTER 10. THE CLOSURE OF THE SET OF CHARACTERISTIC FUNCTIONS 


38. Birkhoff’s theorem. It has been shown in the preceding chapters 
that for |m| sufficiently large the characteristic values of the given integral 
equation are simple, and that the corresponding biorthogonal set of char- 
acteristic functions 4 (x), ¥m(x) may be normalized. We shall now make the 
further assumption: 

(viii) All the characteristic values of the given integral equation are simple. 

Then the entire set of solutions u(x), ¥m(x) may be normalized. We 
wish to show that the set is also closed. To this end we shall employ the 
following theorem which is a generalization of a theorem given by Birkhoff* 
for orthogonal sets. 


* Birkhoff, Proceedings of the National Academy of Sciences, vol. 3 (1917), p. 656. 
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THEOREM. [f w,,(x), Zm(x), m=0, +1, +2,--- , is a closed normalized 
biorthogonal set of functions, and if Wm(x), Zm(x) is a second normalized 
biorthogonal set, then this second set is also closed provided (a) the series 


[wo(x) — wo(x) }Jzo(y) + Do { [wa(x) — wm(x) 
+ [wen(x) — Wm(x) }2-m(y) } 


converges to a function H (x, y) less than 1/2(b—a) in numerical magnitude, and 
(b) the convergence is such that the series multiplied through by an arbitrary 
continuous function f(x) can be integrated term by term with respect to x, and 
yields a series which converges uniformly on any closed sub-interval of a<y <b, 
and of which the sum of any number of terms remains uniformly bounded for 
asysb. 


The proof, except for minor modifications, is that given by Birkhoff. 
Thus if the set w(x), Zm(x) is not closed, there exists a function f(x) +0 
which is continuous on (a, 6) and such that 


[ = 0 forallm. 


In this event if we multiply through the equation of definition for H(x, y) 
by this f(x) and integrate term by term as we may by hypothesis, we find 


f H(x,y)f(x)dx = f f(x) 2)20(y)dz 


+ f(x) [walx)em(y) + wm(x)2-m(y) 

m=1 /a 
where the series on the right hand side converges under the hypothesis to a 
bounded function which is continuous for a<y<b. This function on the 
open interval is precisely f(y). In fact the difference 


b be b 
{ f + f(x) [wm(x)2m(y) + 


is a function g(y) bounded on (a, b), continuous for a<y <b, and such that 


b 
f ¢(y)wily)dy = 0 for alll. 


This latter property is readily drawn from the fact that the series involved 
may be integrated term by term, while the set w,,(y), 2m(y) is normalized 
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biorthogonal. But the set w»(y), zm(y) is closed by hypothesis. Hence we 
infer that g(y)=0, a<y<b, and that the right hand member of the pre- 
ceding equation has the value f(y). That equation may now be written 


b 
f H(x,y)f(x)dx = f(y), 


If the upper bound of |f(x)| on (a, b) is FO we obtain from this since 
|H (x, y)| <1/2(b—a) the relation 


| f(y) | 


But this is impossible since the left hand member either takes on the value 
F or comes arbitrarily near to it. The set wm(x), Zm(x) is therefore closed 
also. 

39. The integro-differential system with an additional parameter. 
We consider now the system 


b 
@ = +f f(a, |, 


(55) 


For »=1 this is simply the system (25) and defines the set of functions 
Um(x,1)=u(x). For 7=0 it is an ordinary differential system and defines 
a set “,,(x,0). For both these values of 7 there exists an associated or adjoint 
system with its corresponding set of solutions v,,(x, 1) or vm(x, 0) which is 
biorthogonal to the respective set above. It is our purpose to show that for 
other suitable values of 7 in the circle C : |n| <1 of the complex 7 plane the 
system (55) also defines a set of functions u,,(x, 7) and that there exists a 
biorthogonal normal set v,,(x, 7). Subsequently it will be shown that the 
repeated application of Birkhofi’s theorem makes it possible to conclude the 
closure of the set #m(x, 1), ¥m(x, 1) from the known closure of the set #m(x, 0), 
Vm(x, 0). 

The system (55) will be equivalent to an integral equation of the form 
(5S) provided the hypotheses of Chapter 4 as applied to system (55) are met. 
The extension of the deductions of Chapter 4 to system (55) is made formally 
by replacing respectively f(a, €), and V(é), by no(x), nf(«, €) and nV(€). 
This is readily found to result in replacing F(x, £) by nF (x, £), and ®(«) by 
P(x, n)=3(1—n)+nP(x). The essential hypothesis to be met is that the 
function be possessed of a reciprocal function E(x, By the 
classical theory of integral equations this reciprocal is a meromorphic function 


+ 
a 
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of n. Hence its poles within the circle C are finite in number and may be 
denoted by 7;. Then for 7 in any closed sub-region of C, call it C’, which 
excludes these points 7;, the function E(x, &, n) is analytic, and by Theo- 
rem 4 the system (55) is equivalent to an integral equation 


b 


The kernel K(x, &, 7), since it satisfies the relation (31) with the formal 
modifications noted above, is readily found to be continuous in x and n, 
and in fact analytic in 7 in the region C’. 

Consider now the associated equation 


b 
(56) = f 


and the deductions of Chapter 3 as applied to it. From the analytic character 
of K(é, x, 7) and its relation to the kernel of (56), the function E(x, £, 7) 
of § 11 is readily found to be analytic in 7. From the classical theory of 
integral equations we draw again the fact that E(x, é, n) possesses a recipro- 
cal F(x, £, ») which is meromorphic in n. If we add to the set of points 7; 
those poles of F(x, £, 7) which lie within C’ and denote by C” any closed 
sub-region of C’ which excludes also these values 7;, it follows that F(x, &, 7) 
exists and is analytic in C’. By Chapter 3 it follows then that the equation 
(56) is also equivalent to an integro-differential system which is the asso- 
ciated system of (55). 

It is essential to observe that in obtaining this result we have not excluded 
from C” either 7=1 or 7=0. The case 7=1 is covered by the hypotheses of 
the preceding chapters. .For 7 =0 the system (55) is equivalent to the integral 
equation 


u(x,0) = 
and the associated equation ; 
o(2,0) = ‘GE, Ode 
is equivalent to the system . 
v'(x,0) + Av(x,0) = 0, 
L(v) = 0. 


The extension of the deductions of Chapters 5, 6, and 7 to the system 
(55) and its associated system may be made now with the formal modifica- 
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tions noted. For the deductions of Chapters 8 and 9 the condition (vi), 
which in the present case takes the form 


= nV(a) + + 7? f | #0, 


must be met. For 7=1 this is assured by previous hypotheses. Hence k, 
and k, as functions of 7 are not identically zero, and there are at most four 
values of » for which either k; or ke will vanish. Let such of these valuesfas 
lie in C’’ be added to the set of points n; and denote by C’” any closed sub- 
region of C” which excludes also these points. This does not exclude n=1, 
and if we suppose, as we may and shall, that 1~0, v0, it does not exclude 
n=0. For 7 in the region C’’’, then, the biorthogonal set of functions 
Um(X, 1), Um(X, n) are for | m| sufficiently large, normalized in the form 


1 z-—a 


+ 
m m ’ 


1 
Um(x,7) = | {1 


Vb—a 


m 


+ 


m m? 


ke(n) 


and the functions Q,(x, ), Qe(x, n), Qs(x, n), H(x, n, m) are found from 
formulas (53), (39), and an analysis of the deductions of Chapters 7, 8 and 9 
to be continuous in x and 7» for 7 in the region C’”. 

It remains to consider the functions of the set #m(x, 7), Ym(x, 7) for small 
index m. Let us denote by D(n, \) the Fredholm determinant for the kernel 
of equation (56). By the classical theory of integral equations this determin- 
ant is analytic in \ and 7 for 7 in the region C’”, and the characteristic values 
of equation (56) are the roots \=X(n) of the equation 


(58) D(n,d) = 0. 
For n=1 the roots of this equation are all simple by the hypothesis (viii), 


1 
6(n) = —— log 


| 

where | 
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and for 7»=0 they may be explicitly determined from system (55) and are 
also found to be simple. It follows* that the equations 


D(n,r) = 0, 


admit of common roots, which will of course be multiple roots of (58), only 
for isolated values of 7. If we add such of these values as lie in C’” to the 
set ni, then for 7 in any closed sub-region of C’’’, call it C'’, which excludes 
also these values, the roots of (58) are all simple and are analytic in 7. 
We observe that by the remarks above 7 =0 and n=1 are not excluded from 
C™. We shall designate by \,,(y) that root of (58) which joins analytically 
with ,,(0). It is easily seen that for large values of |m| this agrees with 
the notation previously adopted in Chapter 8. It follows from the simplicity 
of the values \,.() that the entire biorthogonal set # (x, 7), Um(x, ») may be 
considered normalized. Likewise it is readily found that the functions 
Um(X, 0), Um(X, n) are continuous in x and 7. 

40. A change of variable. The set u(x, 7), Um(x, ) is not directly 
adaptable to the application of Birkhoff’s theorem. For this reason we 
introduce the set w,,(x, 7), Zm(x, n) defined by the relations 


Wm(x,7) Um( x ,n)e~ (2-2) | 
Zm(x,7) = , 


The points 7; in the circle C form a finite set which includes neither 7 =0 
nor 7=1. The constant r>0 may be so chosen, therefore, that of the circles 
drawn with radii r and centers 7; no two have a point in common and none 
includes the point 7=0 or n=1. We shall suppose such circles drawn and 
their interiors removed from the circle C. The remaining region is of the 
type denoted by C"Y and in this region we may draw a curve I of finite length 
connecting 7=0 with »=1. For 7 on the curve I, then, 0(n) is analytic and 
therefore bounded. Inasmuch as w»(x, 7) Zp(x, 7)=Um(X, 7) Up (x, 7), it 
follows that the set w,(x, 7), Zm(*, 9) is also biorthogonal and normal. 
From formulas (57) we obtain readily 


(57a) 


a(x,n) = avi — 


* Forsyth, A. R., Theory of Functions of two Complex Variables, Cambridge University Press, 
1914, pp. 206-209. 


aD(n,d 
an 
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in which the functions Q,, Q2’, Q;’ and H are continuous in x and 7, and 
therefore uniformly continuous for 7 on Tf and x on the interval (a, 5). 

41. The application of Birkhoff’s theorem. From formulas (57a) we 
obtain now 


Wm(x,n) — Wm(x,n) = 
(x,n) (x,7) ; 


where 


60(x) = Q(x,n) — Q(x,7), etc. 
It follows then that 


(59) [wo(x,n) — wo(x,n) + D> [wm(x,n) — wm(x, 7) Jom(y,7) 


m=1 


+ [ w_m( xn) W_m(x,7) 
M-1 


m=—M+1 


2 


4 > b-a — 


6 — @ m=M 


+ 6Q2'(x) = 


> 


b — m=M 
1 5H (x,m) H(y,n,m) 


mom m? 


Let us consider separately each of the series occurring on the right of this 
expression. The first series is a finite sum which is small in numerical value 
for 7 and 7 onI' and |7—7| small. Also since H (x, n, m) is uniformly bounded 
in m, and is uniformly continuous in x and 7 for 7 on I and x on (a, 3), it 
follows that the last series converges uniformly in x and y to a value which 
is numerically small for |7—7| small. 

The second and third series may be written respectively in the forms 


é b—-a — b-—a 
. ¥ 
x—y 
sin 2me(=—*) 
x — 
b—@ mom m 
y-a 
30 Xx) 2 sin 2mr 7—*) 
x —a 
b— 64 m 
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Now series of the type =[(sin mz)/m] are known to converge uniformly save 
in the immediate neighborhood of z=0, +27, +4m,---, where, however, 
the sum of any number of terms remains uniformly bounded.* It follows 
that each of the series in question here converges to a value numerically 
small because of the uniform smallness of the coefficients 5Q,(x), 5Q2’(x) 
for |7»—7| small. Moreover when multiplied through by a continuous 
function of x the series may be integrated term by term with respect to x, 
and will yield in the case of the former a series which converges uniformly 
for a<y <b, and in the case of the latter a series which converges uniformly 
on any closed sub-interval of a<y<b, and of which the sum of any number 
of terms remains bounded for a<y<b. It is evident, therefore, that for a 
properly chosen constant 6>0 and |n—%| <4, n, 7 on I, the convergence of 
the series (59) fulfills the hypotheses of Birkhofi’s theorem. We may con- 
clude, therefore, that the set of functions w»(x, 7), Zm(x, 7) is closed if 
the set w(x, 7), Zm(x, 7) is closed. 
Let the curve I be divided now by points 7” such that 
n =0, |n® <5, 

and let the number of divisions be s, so that 7 =1. For 1 we have the 
set of functions 


1 1 
0) — 2mri 


Vb—a Vb—a 


which is known to be closed. By an application of Birkhoff’s theorem we 
may conclude then that the set w,(x, 7%), 2m(x, 7) is closed for j=1, 
and by successively repeated applications for 7=2, 3,---, 5s. This last 
application proves the closure of the set w»(x, 1), m(x, 1). This, however, 
implies that the relation 


Wa(x,0) = 


ff = ff =0 


can be true for all m only if 


4 


f(x) =0, 

i.e. if f(x)=0. It follows from this, however, that the set u(x), ¥m(x) is 
closed. This is the closure we wished to establish and hence we have the 
following theorem: 


* Cf., e. g., Jackson, D., Rendiconti del Circolo Matematico di Palermo, vol. 32 (1911), 
pp. 257—262, and Bécher, M., Annals of Mathematics, ser. 2, vol. 7 (1906), pp. 110, 111. 
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THEOREM 9. If the kernel of the given integral equation satisfies the condi- 
tions (iv)—(viii), and that of the associated integral equation satisfies condition 
(v), then the set of characteristic solutions Um(x), Um(x) 48 closed. 


CHAPTER 11. THE NON-EXISTENCE OF OTHER CHARACTERISTIC 
VALUES OR FUNCTIONS 


42. The possibilities of omission. The deduction of a solution of equa- 
tion (25a), and hence also the deduction of the characteristic values, was 
based on the methods of Chapter 6. It is conceivable that other methods 
might yield a different form of solution, from which either different charac- 
teristic values or different characteristic functions or both might result. 
We shall consider in turn the following questions: 

(a) May there exist a characteristic value Nanda corresponding solution 
a(x) +0 which are not included in the sets Am and Um(x)? 

(b) May there exist a characteristic function #(x) +0 which corresponds, 
say, to A;, but is not identical with the u;(x) found? 

(c) May the characteristic function «;(x) found correspond also to a 
characteristic value ) not identical with ),? 

Case (a). Suppose the value \ and the function a(x) mentioned under 
(a) above to exist. From the biorthogonality of the solutions of associated 
integral equations it follows that 


b 
= 0 (m=0, +1, +2,---). 
Since the set u(x), Ym(x) is closed, however, this demands #(x) =0. 

Case (b). Suppose a(x) is a solution which corresponds to the value dj. 
Then from the biorthogonality we conclude that 


f Ui(x)0m(x)dx = 0, 
If for m=/ the integral also vanishes we may conclude as above from the 
closure of the set “m(x), ¥m(x), that a(x)=0. If this is not the case we may 
suppose the solution #(x) normalized by multiplication with a suitable 
constant so that 


ff = 1. 


Then, however, 


ftw — ui(x)}om(x)dx (m=0,4+1,---), 
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and from the closure of the set “»(x), Ym(x) it follows that 


ai(x)= u;(x). 


Case (c). Suppose that the function u;(x) corresponds to a characteristic 
value \+d;. We have in this case 


f 


from which, since 


f K(x,8)ui()dt = 0. 


This, however, is impossible since “;(x) is a solution of the given integral 
equation and u;(x) +0. 

It is seen, therefore, that there exist no characteristic values or functions 
other than those determined in Chapters 8 and 9. 


CHAPTER 12. THE EXPANSION OF AN ARBITRARY FUNCTION 


43. The related differential system. The method by which the proper- 
ties of the solutions ~,,(x) for the expansion of an arbitrary function are to be 
deduced is based on a suitable comparison of the set u(x) with the set of 
solutions y,(x) of the differential system 


y'(x) — py(x) = 0, 
kiy(a) — key(b) = 0. 


This system is related to the given integral equation in that the constants 
k, and kz are those which occur also in the characteristic equation (45). 

It is essential, therefore, to have at hand the expansion theorem for a 
system of type (60). In a form which may be found by specialization of 
results obtained by the author* for more general differential systems this 
theorem may be stated as follows: 


(60) 


* Developments associated with a boundary problem not linear in the parameter, these Trans- 
actions, vol. 25 (1923), pp. 155-172. If in the theorem there stated (p. 171) we set n=1, a(x) =1 
and change the dependent variable and the parameter respectively by the substutitions 

1 ke 
u(x) = /b—a hiy(x)e'-2 “hi, log 


1 


the system considered takes the form (60) above. 
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If f(x) is any function which on the interval (a, 5) consists of at most a 
finite number of pieces, each real and continuous and having a continuous 
derivative, then f(x) may be expanded in a series of the form 


+0 b 
f(x) ~ XY  fOzm(t)dt - ym(x), 


where the functions y,,(x) and z,,(x) are respectively the normalized solutions 
of system (60) above and its adjoint system. This series will converge to 
for a<x<b, to (1/2k:) {kif(at+)+hof(b—)} for x=a, 
and to (1/2ke) {kif(at+) +hef(b—)} for x=). 

44. The function g(x, —, »). We consider now the function g(x, &, p) 
defined by the relation 


m=—p 


Its explicit form may be obtained from the solution of system (60), which 
yields 


1 
Ym(X)Sm(E) = ——— , 
b-—a 


and the formulas (54). It is found in this way that for |m| sufficiently large 


b-—a m m 


m? m 
With the use, now, of the notation of Chapter 10, i.e. 


Pm = + 6, 


and with the choice of a number M fixed and sufficiently large, we obtain 
the formula 


(62) o(x,,p) = o(x,&,M) + Do vdx,é,p), 


1 ky 
@ = —— log 
b-a ke 
4 


x 
sin 


2% P 


m=M+1 


sin 2me( *) 
— b— 


b—a m=M+1 
2% 


A(x,m)H(é, 
sin 2mx( ) 
= 
m=M+1 m 


m=M+1 m?* 


Now sums of the type =[(sin mz)/m] as already noted remain uniformly 
bounded for all values of z. Since this is obviously true also of a sum 
2[H (z,m)/m?] we may conclude that 


(64) | o(x,£,p) | < A (aconstant ), 


for all x and é on (a, db) and all p. 
Let a and 6 be chosen now as any two points of the interval (a, b). We 
have then from (62) for p>M 


8 4 8 
(65) f o(x,t,p)dt = f ola, ff 


It is our purpose to consider the convergence of this integral as poo. 
The first term on the right is free from p and is clearly continuous over the 
interval (a, b). It is likewise clear that the integral involving yw; will con- 
verge uniformly to a function which is continuous on (a, 6). For the consider- 
ation of the remaining terms we recall the fact, also noted previously, that 
sums of the type =[(sin mz)/m] converge for all values of z and do so uni- 
formly except in the immediate neighborhood of z=0, +27,---. It follows 
because of the integration involved that the integrals containing y, and yz 
converge uniformly over the entire interval, and hence further because of 
the continuity of the individual terms that the limiting functions are con- 
tinuous over (a, b). 

The integral containing y is exceptional. The trigonometric sum in- 
volved remains in this case unaffected by the integration, and while this 
sum converges for all x it does so uniformly only over an interval which 
does not extend to x=a, or x=b. Hence the integral converges uniformly 


634 R. E. LANGER [October 
where 
(63) 
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for a<a,<x<b,<b, and the limiting function is continuous only on the 
open interval a<x<b. We may write, then, upon collecting our results, 


uniformly for 


B 
(66) lim } o(x,&,p)d§ = ¥(x) 


where ®(x) is continuous for a<x<b. 
45. The evaluation of P(x). Let f(x) now be the function defined as 


follows: 
1forasx<8, 


f(z) = x<aandB<zxZb. 


This function satisfies the conditions of the theorem in § 43 for expansibility 
in terms of the solutions y,,(x), whence we have 


+0 b 
DL -ym(x) = f(x), 


where f(x) is bounded and differs from f(x) only in the points a, 8, a and b. 
Inasmuch as 


b 8 
we obtain with the use of (61) and (66) the result 
+00 b 
B(x) = um(x) — fix). 


Multiplying this equality by 2;(x) and integrating* it with respect to x 
we find, since the set “m(x), Y¥m(x) is normalized biorthogonal, that 


b 
f = f f = 0. 


Since this result may be derived for all values of /, whereas the set u(x), 
Vm(X) is closed, it follows that (x) =0, a<x<b. The relation (66) now takes 
the form 

uniformly for 


B 
67 
(67) im | o(x,£,p)dt asxs bhi, 


Pro Va 


and this relation holds for all choices of the limits a and 6 on (a, b). 


* From the character of the series as shown by the asymptotic forms of té4m(x), 2m(x) this in- 
tegration term by term is seen to be permissible. 
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The properties (64) and (67), which have thus been established for the 
function ¢(x, £, p), are the hypotheses of a theorem by Lebesgue* in accord- 
ance with which it follows that if f(«) is any function which is summable 
over the interval (a, b) then 
uniformly for 
h. 


(68) lim f = 0 


We defer the complete formulation of this result to the end of the chapter. 

46. The evaluation of (x) at the end points of (a, b). It was observed 
in the derivation of (66) from (65) that the final term of (65) alone fails to 
converge to a function which is continuous over the entire closed interval 
(a, 6). Accordingly we have 


z=b 8 z=b 
=| tim f vat, | 
z= b— prova z=b— 


z=b 
sin 2me(= *) 


b— aa m=M+1 m z=b— 


In virtue of the discussion just completed, however, ®(b—) =0. On the other 
hand the series on the right is, except for a finite number of continuous terms, 
the Fourier expansion of the function 7(}—(«x—a)/(x—b)). The value of 
the bracket on the right is, therefore, readily found to be 7/2. Substituting, 
then, for Q3(£) its value as given by (53b), and for e?°— its value ki/k2, we 
obtain the relation 


1 B_ 
(0) = (vhs + f 


which reduces further because of (47) and (44b) to 


B 
(69a) = f 


The function 2.(£) involved here is that which occurs also in the char- 
acteristic equation (45). It is found in an entirely similar fashion that 


8 
(69b) = f 


* Annales de la Faculté des Sciences de Toulouse, ser. 3, vol. 1 (1909), p. 52 and p. 68. 


¥ 
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Let the functions ¢,(£, p), 7=1, 2, be defined now by the relations 


It follows immediately from the bounded character of 2:(£) and @.(), and 
from the relations (64), (66) and (69) that 


| eil&,p)| < As (i = 1,2) 


for all — on (a, b) and all p, and further that 


8B 
lim = 0 
Pro Va 
for all choices of a and # on (a, 6). By Lebesgue’s theorem, then, we may 
conclude that 


6 
(70) lim | f(é)ei(t,p)dt = 0, 
Pro Va 
for every function f(x) which is summable over (a, b). 
47. The expansion theorems. If we observe now that 


b b b 
a m=—pV/a m=—pVJ/a 
we may formulate of the results embodied in the relations (67) and (69) 
as follows. 


THEOREM 10. If f(x) is any function which is summable over the interval 
(a, b), and if F1,(x) and F2,(x) are the corresponding sums 


Pp b 
F.,(2)= > f tem (2) 


Pp b 


where Um(X), Um(X) are respectively the normalized solutions of the given integral 
equation and its associated equation, and Vm(X), Zm(x) are respectively the 
normalized solutions of the related differential system (given by (60) above) and 
its adjoint system, then 


4 
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uniformly for 
lim [Fip(x) — F 


lim [F1(a) — = f 


b 
lim [Fip(0) — = ff 
a 
The functions 2,() and Q2(&) are determined by the integral equation and are 
independent of the function f(x). 


A more explicit but less general formulation becomes possible if we 
utilize the known expansion theorem for the related differential system as 
given in § 43. Thus we may state 


THEOREM 11. Jf f(x) is any function which on the interval (a, b) consists 
of at most a finite number of pieces, each real and continuous and having a con- 
tinuous derivative, then f(x) may be expanded in a series of the form 


+00 


This series converges to 


{f(x+)+f(x-)} fora<x<b; 
to 


1 
+) + —)} + forza; 


1 6 


APPENDIX. AN EXAMPLE 


It is of interest to apply the preceding theory to a specific example par- 
ticularly because of the fact that the theory demands in Chapters 8 and 11 
that the ratio k:/k2 and the functions 2,(x) and Q2.(x) be independent of the 
parameters wu and v. We consider therefore the integral equation 


b 
u(x) = f 


4+ 2 for x &, 


K(x,t) = 
for x < &. 


638 
to 

where 
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The equation is in normal form, and computing the various functions by the 
respective formulas we find readily 
A = p? + (b* + 3)uv + 2r?, 
W(x) = + + 1), 
V(t) =u (b's? + 2), 
2(u + 2») 
E(x,£) [(u + v) x? vb? JE, 
_ (ue +») [2 — bu + (4 + 26%) 
2A 


D 


— E(x,) 
— 8(4 + 2v) xt 
2— bu + (4+ 
8b2x 
(2 — + (4+ 
b 2b*[2u + (4 + 
(2 by + (4+ 26% 
(2 + + ») 
ki = 
D 
_ (2 ») 
The ratio k,/ke is thus free from yp, v, having the value 
ky _ 2(2 + 5*) 


F(x,&) = 


= ( 


= 


ke 


Further we find 
— 2b?x(u + v) 


b 
V(x) + f x)dt = 


whence 

2Q;(x) 
2+ 
— 


These values are thus also free from yw and vy as they should be. 


Q2(x) = 


DartmoutTH COLLEGE, 
Hanover, N. H. 


i 
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THE FIGURATRIX IN THE CALCULUS OF 
VARIATIONS* 


BY 
PAUL R. RIDER 


Introduction. In the study of a definite integral of the form 


ts 
(1) 

| 
Hadamard defines the figurative of the point (x, y) as the curve F(x’, y’) =1, 
where x’ and y’ are the current coérdinates, and x and y are considered 
constant.f The polar reciprocal of the figurative with respect to the unit 
circle x”+y’=1 is termed by Hadamard the figuratrix.{ The importance 
of the figuratrix and its space analogue seems to have been overlooked, 
and it is the desire of the writer to call attention to a number of their in- 
teresting properties which enable one to interpret geometrically some of 
the well known functions and theorems of the calculus of variations. 

1. Figurative, figuratrix, and indicatrix. We shall prefer to consider 

an integral of the form 


where 7 =arctan(y’/x’). On account of the homogeneity conditions which 
must be satisfied by the function F in (1),§ we have 


(3) F(x,y,x’,y’) = F(x,y,cos 7, sin r)(x’? + y’2)!/2 


and the integral (1) can always be reduced to the form (2). In (2), however, 
the function f(x, y, 7) need not be periodic in 7; the right member of equa- 
tion (3) must be. Moreover, the homogeneity conditions mentioned above 
do not enter into the consideration of the function f(x, y, 7).|| 


* Presented to the Society, December 26, 1924; received by the editors in December, 1924. 
t Hadamard, Legons sur le Calcul des Variations, vol. 1, p. 91. 


t Hadamard, loc. cit., p. 92. 
§ F must be positive homogeneous of degree one in x’ and y’. See Bolza, Vorlesungen iiber 


Variationsrechnung, p. 194. 
|| See Bliss, A new form of the simplest problem of the calculus of variations, these Transactions, 


vol. 8 (1907), p. 406. This paper will be referred to as Bliss (I). 
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Let P, of coérdinates x and y, be a fixed point on an extremal* for the 
integral (2). Take P as an origin, p and g as rectangular coérdinates, and 
consider the envelope of the variable line p cos r+g sin r=f(r). It is readily 
found that the parametric equations of this envelope are 


(4) g=fsinr +f, cosr, 
functions which play an important rdéle in the study of an integral of the 
form (2). (Subscripts denote partial differentiation.) It is thus easy to 
identify this envelope with the figuratrix as defined by Hadamard.t For 
the rectangular codrdinates of a point on the figuratrix are F,-(x’, y’), 
F(x’, y’). But as a consequence of the homogeneity conditions satisfied 
by F we havet 
(S) a(x’, y’) + y'Fy(x',y') = F(x’,9’) = F( sin r)(x’? + 
or 
(6) F.z( cost, sinr) cost +F,y(cos7, sinr) sint = F(cosr, sinr) = f(r). 
Differentiating, we get 
(7) f(r) = —F,(cosr, sinr) sinr + Fy(cos7, sin rt) cos r. 
Solution of (6) and (7) for F, and Fy gives 
(8) Fy, = fcost — f,sinr, Fy = fsinr + f, cost. 
The arguments of the functions F,, and F, may be either (x’, y’) or (cos 7, 
sin rT), since these functions are homogeneous of degree zero with respect 
to x’ and y’. A comparison of (4) and (8) shows that the envelope of the line 
p cos t+q sin r=f(r) is Hadamard’s figuratrix. 
If we define the polar figurative as the locus 
(9) p = f(z), 


it is seen that the polar figurative is the pedal of the figuratrix with respect 
to the origin P. 
The indicatrix§ is the curve 


(10) 


* See Bolza, loc. cit., pp. 32, 203. 

¢ It may be remarked that the rectangular equation of the figuratrix is of the form A(zx,y, 
?,q) = 0, where H is a function satisfying the canonical form of Euler’s equations in the calculus of 
variations. See Hadamard, loc. cit., pp. 151-153. 

t See Bolza, loc. cit., pp. 194-197. 

§ See Bolza, loc. cit., p. 247. 


1 
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Since f(r) =F (cos 7, sin r) =F (x’, equation (10) is equiva- 
lent to the equation F(x’, y’)=1. That is, the indicatrix is the same as the 
(rectangular) figurative. 

It also follows that 


The polar figurative and the indicatrix are inverse curves* with respect to 
the circle p’+q°=1; 

The figuratrix and the indicatrix are reciprocal polarst with respect to 
the circle =1. 


In Fig. 1, P is the origin (a point on an extremal), Q is a point on the 
figuratrix, NV is a point on the polar figurative, NQ is tangent to the figuratrix. 
; Since the figuratrix and the indicatrix are reciprocal polars, the polar 
of Q (call it MR) is tangent to the indicatrix at M. This tangent line MR, 
however, gives the direction which is transverse to the direction PN. 


fsin T+ COST 


Transverse 
direction 


P fcost/-f,sint 


Fic. 1 Fic. 2 


Thus, the direction 7 which is transverse to r, or PN, is the direciion PT which 
is perpendicular to PQ. 


This also follows from the formula$ which gives the direction 7 trans- 
verse to 7, viz.: 


(11) fcos(# — 7) + f,sin(7 — = 0. 


For a reference to Fig. 1 or Fig. 2 shows that the left member of (11) is 
simply the projection of PQ upon the line whose direction is 7. When 


* See Williamson, Differential Calculus, 1899, p. 225. 
t See Williamson, loc. cit., p. 228. 

t See Bolza, loc. cit., p. 304. 

§ See Bliss (I), p. 413. 


Radius of 
curvature S 
Q 
\ 
R fe 
| 
p+q 
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this projection is zero the line having the direction 7 is perpendicular to PQ. 

The normal to the polar figurative is the line joining the point N with the 
middle potnt of the line PQ.* 

2. Geometric interpretations. It can be shown without difficulty 
that PQ=(f?+/?)"*. Consequently NQ=f,, since PN=f. The radius 
of curvature of the figuratrix at Q is easily computed to be f+f,+, which is the 
important f,-function for an integral of type (2). 

The Weierstrass e-function for the integral (2) is 


(12) e(x,,7,7) = f(r) — [f(r) cos (F — r) + f(r) sin (F — 7)]. 


If, in Fig. 2, Q is a point on the figuratrix, it is seen immediately that the 
term in square brackets in (12) is PS, the projection of PQ upon the line 
making an angle 7 with the polar axis. If PN =f(r), the e-function is the 
line-segment SN. If the integral (2) is to be minimized, we must have 
e(x, y, tT, 7) 20 for the set of values (x, y, 7) giving the point P and the 
direction of the extremal at that point, and for all values of 7.— Conse- 
sequently, the projection of PQ upon the line r=7 cannot be greater than f(r). 

If d is the distance from the tangent line to the figuratrix at Q(r), i. e. 
p cos r+q sin r—f(r) =0, to the point Q(7) on the curve, then 


d = [f(7) cos? — f,(7) cos + [f(~) sin? + f,(7) cos 7] — f(r) 


= f(t) cos (F — r) — sin — 7) — f(r). 
Expanding in powers of (r—7) we find that 
(13) d= + +h 
=— 


A similar expansion gives 


e(r,7) = f(z) — [f(r) cos (F — r) + f(r) sin (F — 7)] 
2 (7 7)*[f(r) + ] +j 
=3(7 
Thus 
(14) e(r,7)= —d+k. 


* See Goursat-Hedrick, Mathematical Analysis, vol. 1, p. 69. 
t See Bliss (I), p. 412. 
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The quantities h, 7, k are homogeneous of degree three in (tT —7) and contain 
partial derivatives of f with respect to 7 to an order not exceeding the fourth. 
If these derivatives are finite, d and e are infinitesimals of the second order 
with respect to (FT—7). The geometric significance of e is apparent from (14). 
The function e(r, 7) is the distance from the point Q(7) on the figuratrix to 
the tangent line at the point Q(r), except for an infinitesimal of at least the third 
order with respect to (F-—7r). Under the assumption /,;=0,* this distance is 
positivet for points Q near Q, and consequently the figuratrix is concave 
toward the origin P at the point Q(r). 

A well known theorem due to Hedrick states that all integrand functions 
for which the transversality condition is equivalent to orthogonality are 
of the form F(x, y, x’, y’)=G(x, y) («”+y”")"*. In §1 we found that 
the direction transverse to PN(r) in Fig. 1 is PT(7), which is perpen- 
dicular to PQ. But if transversality is to be orthogonality, that is, if PT 
is to be perpendicular to PN, then PN and PQ must coincide. This of 
course means that the figuratrix, which it will be remembered is the en- 
velope of the variable line NQ, must coincide with the polar figurative 
(locus of NV) and that both curves must be circles. Therefore the equation 
of the polar figurative must be independent of 7, which means that the 
function f(x, y,7) must be of the form g(x, y). Thus we have a simple 
proof of Hedrick’s theorem. 

We shall now investigate the case of discontinuous solutions. Let 
P,PP, be an extremal having a corner at the point P. Let 7 be the direc- 
tion of P,P at P, and 7 the direction of PP; at P. Then at P the following 
relations, called corner conditions, must be satisfied :§ 


f(r) cost — f,(r) = f(7) cost — f,(7)sin7, 
f(r) sin + f(r) cos = f(r) + f,(7)cos7. 


If we refer to (4), the parametric equations for the figuratrix, we see that 
the above corner conditions simply state that for the corner P the directions 
t and 7 give the same point Q on the figuratrix. This is to be expected, since 
corresponding to a corner point on the extremal we have a double tangent 
to the indicatrix, || and the figuratrix is the reciprocal polar of the indicatrix. 


* The function f; must be 2 0 if the integral in (2) is to be minimized. See Bliss (I), p. 409. 

+ Since it is the negative of d. 

t See Bolza, loc. cit., chapter 8. 

§ See Rider, A note on discontinuous solutions in the calculus of variations, Bulletin of the 
American Mathematical Society, vol. 23 (1917), p. 239. 

|| See Bolza, loc. cit., p. 369. 
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It follows that the directions which are transverse to rt and 7 at the corner P 
coincide,* since the transverse direction is obtained by drawing a perpendi- 
cular to PQ. 

Bliss has defined generalized angle{ and generalized areat as 

f@) 
respectively, in which 7 is the direction transverse to r. It may be worth 
mentioning that if in Fig. 1 we mark off on PT a length PN equal to f(7), 
then the integrand of the second integral in (15) is the area of the rectangle whose 
sides are PQ and PN, and the integrand of the first integral is the ratio of 
the area of this rectangle to the area of the square whose side is PN. 

3. The figuratrix in space. For an integral of the form 


te 
(16) f F(x,y,2,x',y',2')dt, 
t 


1 
in which x, y, z are functions of #, the figurative of the point (x, y, z) is 
defined§ as the surface F(x’, y’,z’)=1. The figuratrix is defined as the 
polar reciprocal of this surface with respect to the unit sphere «”+y"+2” =1. 
We shall, however, deal with the integral || 


(17) f fla,y,2,7,0)(a!? + + Pat, 


where 7 and a are defined by the equations 
+ + 2/2)1/2 = cost cose, 
(18) + + 2/2)1/2 = sins cosa, 
+ + = sing. 


The geometric significance of 7 and a is at once evident. If at any point 
of a space curve the positive tangent is constructed, then 7 is the angle 
which this tangent line makes with its own projection in the xy-plane, 
and a is the angle which this projection makes with the x-axis. The figuratrix 
will be defined as the envelope of the two-parameter family of planes 


(19) sing = f(7,¢0), 


* Cf. Bolza, loc. cit., p. 369. 

{ Bliss, A generalization of the notion of angle, these Transactions, vol. 7 (1906), pp. 184-196. 

t Bliss, Generalizations of geodesic curvature and a theorem of Gauss concerning geodesic triangles, 
American Journal of Mathematics, vol. 37 (1915), pp. 1-18. 

§ See Hadamard, loc. cit., p. 96. 

|| Cf. Rider, The space problem of the calculus of variations in terms of angle, American Journal 
of Mathematics, vol. 39 (1917), pp. 241-256. This paper will be referred to as Rider (II). 
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?, 9, 7 being rectangular coérdinates with respect to an origin P on an ex- 


tremal. 
As before, this envelope can be proved identical with Hadamard’s 


figuratrix. In this section we shall develop some of its properties, which 
are perhaps even more interesting than those of the figuratrix in two di- 
mensions. Where the generalization from two to three dimensions is quite 
obvious, proofs will be omitted. 

By differentiating equation (19) partially with respect to r and o we 
obtain the equations 


— psinr cose + qcosrcose = f,(r,0), 


(20) 
— pceosrsine — gsinrsine + rcosoe = f,(r,c). 


Solving equations (19) and (20) for ~, g, r (this can be done if cos +0), 
we get* 


p = fcostrcose — f,sinr/ cosa — f, cos7sing, 


(21) q = fsinr cosa + f,cost/ cosa — sing, 


the rectangular codrdinates of a point Q on the figuratrix. It is readily 
verified that PO =(p?+@?+7°)"? 


Fic. 3 
Let PN =f(r,0), PM=1/f(r,c). (See Fig. 3.) Draw NU =f, making 
an angle of 7/2 with PN and lying in the plane of PN and the r-axis. Then 
* Cf. Rider (II), p. 243, equations (5). 
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the line-segment UQ will be parallel to the pg-plane, perpendicular to NU, 
and equal in length to f,/cos¢. Moreover 


The polar figurative is the pedal surface of the figuratrix with respect to 
the origin P; 

The indicatrix is identical with the (rectangular) figurative; 

The polar figurative and the indicatrix are inverse surfaces with respect 
to the sphere p’+q’+r’=1; 

The figuratrix and the indicatrix are reciprocal polars with respect to the 
sphere 

The plane NUQ is of course tangent at Q to the figuratrix ; the plane through 
M perpendicular to PQ is tangent to the indicatrix at M; 

The normal to the polar figurative is the line joining the point N with the 
middle point of the line PQ. 


The direction PT (7, ) which is transverse to PN(r,¢) is given by the 
equation* 


f( cos r cos cos 7 cos + sin cos sin cos + sino sin 


+ fr sin 7 cos ¢ — cos T cos 


cos 


— f.( cos r sin cos cos@ + sin sino sin 7 cos — cosa sing) = 0. 


Since the direction cosines of PT (7, are cos 7 cos @, sin cos G, sin G, 
and those of PQ are proportional to #, g, r of (21), it is seen that the direc- 
tion PT (rt, &) which is transverse to the direction PN(r,o) is perpendicular 
to PQ. 

It follows at once that if transversality means orthogonality, the function 
f(x, y, 2, 7,0) must be of the form g(x, y, 2). 

The parameters of the surface (21) are r and o and the fundamental 
quantities of the first order are 


E=p?+@t+7, get te; 


The functions 9;, Po, Yo, % are the quantities —a,, —a,, —a; 
—be, —b3, respectively, found in an earlier paper by the writer,} and it is 
not difficult to show that 


* Cf. Rider (II), p. 250. 
t Rider (II), pp. 243-244. 
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E = (f, tang + fre)*/ cos? a + (f + fos)”, 

F = (f, tana + fre) [(f + foo) + (f — fe tan + f,,/ cos? o)], 
G = (f, tana + f,.)? + cos? o(f — f, tana + f,,/ 

H = (EG — F)'? = + f,/ cose, 


where 
fi = (f cos? — f, sina cosa + fir) (f + fas) — (fr + fre)?. 


If 


H H peo H \ pe 


then X=cosr cosa, Y=sinr cosa, Z=sing. This result is to be ex- 
pected ; for X, Y, Z are the direction cosines of the normal to the figuratrix, 
and the plane (19) is tangent to the figuratrix. The fundamental coefficients 
of the second order for the figuratrix are 
D= X + VQrr + Zr, = + Y,q: + 
= — (fcos*o — f, sina cosa + f,,), 
= X pre + + Lire + + 
(Xp, + + (f- tane + fee) 
= X poo + + = — (Xepe + + Z.%e) 
Ci + fac) 
The total curvature of the figuratrix is (DD’’—D"”)/H* =cos’ a/f;. Since 
for a regular problem /; must be positive,* the figuratrix is a surface of positive 


curvature at Q. 
It can be shown that if the integral (17) is to be minimized or maximized 


the function f; cannot be negative. If we assume f,>0, then D and D” 
must be negative for a minimum and positive for a maximum. f 
The Weierstrass e-function for the integral (17) ist 


e(7,0,7,0,) = f(7,c) 
— [f(r,0,)( cos cos cos 7 cos + sin cos sin7 cos ¢ + sing sina) 


(22) + cos — sin cos7 + cos7 sin 7) 
cos 


+ f.(r,0)( — cosr sino cos7 — sing cose 
+ sina) ]. 
* See Rider (II), p. 249. 


t See Rider (II), p. 249. 
t See Rider (II), p. 247. 
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But PS, the projection of the line PQ upon the line whose direction is 
(7, ©), is equal to 


(23) pcostcos¢ +qsinz7cos¢+rsing, 


in which p, g, r are given by (21). Itis not difficult to show that the ex- 
pression (23) is the same as that in the square brackets in (22). Therefore, 
if PN = {(7, ) is marked off on the line whose direction is given by (7, @), 
the e-function is the line segment SN. If the integral (17) is to be made a 
minimum, we must have e(x, y, 2, 7, ¢, 7, ¢)20 for the set of values 
(x, y, 2, 7,0) giving the point P and the direction of the extremal at that 
point, and for all values of 7 and &.* Consequently, the projection of PQ 
upon the line t=7,0=G cannot be greater than f(T, @). 
The e-function may be written in the formt 


e(r,0,7,0) = — 3 [(F — + — — + (© — | +h. 


Thus,t the e-function is the distance from the point Q(7, @) on the figuratrix 
to the tangent plane at the point Q(r, 0), except for an infinitesimal of at least 
the third order with respect to (t-—r) and (@—-c). 

Since the e-function must be positive for a minimum of the integral (17), 
the figuratrix is concave at Q toward the origin P. 

The statements regarding discontinuous solutions admit of obvious 
generalization from two to three dimensions. 

4. Generalization to n dimensions. For the sake of completeness we 
shall generalize for the case of m dimensions. Let us consider the integral§ 


(24) fs, Thy + + xl?) 


in which the x’s and 7’s are functions of ¢. The quantity 7,1 is the angle 
made by the positive tangent to a curve with its own projection in the 
(n—1)-space (x*1,---+,%n-1), Ta-2 is the angle made by this projection 
with its own projection in the (n—2)-space (x1, +, %n-2), and so on. 
Analytically the 7’s are defined by the equations 


(25) = arctan (x/4:/(a/? + - - - + xf?)1/2) 


* See Rider (II), p. 248. 

T See Rider (II), p. 248. 

t See Eisenhart, Differential Geometry, p. 114. 

§ Cf. Rider, On the problem of the calculus of variations in n dimensions, T6 hoku Mathemati- 
cal Journal, vol. 13 (1918), pp. 165-171. This paper will be referred to as Rider (III). 


650 
from which it is seen that 
(26) xf + + = sin cos (4 = 1, 1) 


(if we make the convention sin t>=1). 

Let us now take as origin a point P(x, - - - , x,) on an extremal for the 
integral (24), and as coérdinates the variables f:,---,p,. We shall de- 
fine the figuratrix of the point P as the envelope of the variable hyperplane 


(27) Pi Sin COS Tg * COS = Tn—1)- 
t=—1 
Differentiating partially with respect to 71, ---,7:-1, we obtain the fol- 


lowing equations: 


— pisin 71 COS T2 COS + COS COS = fr,, 


— COS 71 SiN T2 COS Tz COS Ta_1 — P2 SiN SIN Tz COS Tz COS 


+ p3 COS T3+ ++ COS Tr-1 = fr, 


(28) 


— COS COS SIN — * — SIM SIN Tp-1 
+ Pn COS = 
If cos cos? 73 - - T,_1, the determinant of the coefficients of the 


system of equations composed of (27) and (28), is different from zero we 
can solve for the f’s obtaining 


COS Ty-1 


pi = f sin Cos COS Ta-1 + 
COS Ty * * * COS 


COS Tj41° * * COS 


the rectangular coérdinates of a point Q on the figuratrix. 
As in the preceding cases, the direction which is transverse to the direction 
(71, - ++, orthogonal to PQ.* 
Since the normal to the figuratrix is parallel to the line PN, the directio 
cosines of this normal are 


(30) = xf - -- + = sin cos 75 COS (4 = 1, +, m). 


* For the transversality condition see Rider (III), pp. 168-169. 


= . . . . . . . . 
(29) - (§=1,---, 9), 
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The fundamental coefficients of the second order for the figuratrix are then 
defined as 


OX ; Op; 


31 Daw = 
( ) OT; OTK 
The partial derivatives occurring in Dz are found to be 


Ox; 
OT; 


= — sin COS COS Tj-1 SIN T; COS Tj41° * * COS G 2%), 


COS COS * COS Tn-1, 


sin 7; SiN COS T; COS 


COS Tj41°* * * COS Tn-1 
n—1 
-> Jr, SiN Te COS COS? 7 + + COS? TRA 
k= j+1 


COS Ti-1 


tan 75 + 


COS Ty * * * COS Tn-1 


sin rz SiN COS T; COS TE-1 


(f,, tan 75 + 


COS Tk41°* * * COS 


sin rz SiN COS Ty * * COS 


(f,;tan + G24), 


ko j+1 COS ° * * COS Tn-1 


COS 


(f cos? 75 COS? 
COS * * COS 


n—1 


Jr, Sin COS COS? + + COS* home 
+ 


sin rz SiN COS T; * * COS TE-1 


bud COS Tk41° * * COS Tn-1 


dp; COS 


(fr, tan 741 + 


dt; COST{*** COS Tp-1 


sin 7% SiN COS * * * COS TE-1 


(f:, tan tr, + G <i- 1). 


COS Tk41° * * COS 


Ox; 
OTi-1 
Ox; 
G <t— 1), 
OT; 

dp; 
dr; 
dp 
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It is found that 
n—1 
Dix = — Uf cos? COS* Tr — Lh; sin 7; COS T; COS? COS*7 4-1 
j=i+1 


+ 
Di; = — (fr, tant; + (7 # j). 


(It is assumed that f cos” 7441 - - - COS? Ta-1=f when i=n—1.) The writer 
has already shown that for a minimum of the integral (24) the quantities Dix 
cannot be positive.* 

Let us define the quantity H by means of the formula 


2 


on > D?, 


t=1 


OTn-1 


where D; is the determinant obtained by omitting the ith row from the 
matrix. Thenf 


D; = ( — 1)*"f:X;/ cos 72 cos? 73 - 
where f/f; is the symmetric determinant 
Du- - - Dis 
fi=(-1)™'! 


Din Dan 
Thus 


H? = f?/( cos r2 73 - - - cos 
The total curvature of the figuratrix is 


Dy Din 


_(-)" 
fi 


(cos cos? - - CoS"~? 
Dan 


* See Rider (III), p. 171. In that paper ag; is the negative of Djj. 
t See Rider, On the f;-function in the calculus of variations, Washington University Studies, 
vol. 5, scientific series no. 2 (Jan., 1918), p. 100. 
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The Weierstrass e-function is the function* 


n 


i=1 


1 n—1 n—1 


i=1 j=i+l 


the p’s being defined by (29). It thus appears, that with proper restrictions 
on the partial derivatives of f which occur in h, the e-function is the distance 
from the point a) on the figuratrix to the tangent hyperplane at Q except for an 
infinitesimal of at least the third order with respect to the quantities (7;—T7;). 
The concavity of the figuratrix toward the origin P follows, since e must be 
positive for a minimum value of the integral (24). 


* See Rider (III), pp. 169-170. 
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ON THE EXISTENCE OF FIELDS IN BOOLEAN 
ALGEBRAS* 
BY 
B. A. BERNSTEIN 
A class K is said to be a field with respect to a pair of operations A, O 
if A, O play in K the same réle which the operations +, X play in the class 
of rational numbers. My aim in this paper is to determine in any boolean 
algebra all pairs of operations expressible in terms of addition, multiplication, 
and negation for which the elements are a field. 
Postulates for fields. The conditions that make a class K a field with 
respect to a pair of operations A, Oaref given by the following seven postu- 
lates 


P,. xAy=vyAx, if x, y, yAx are K-elements. 

P,. (xAy)Az=xA(yAz), if x, y, 2, xAy, yAz, xA(yAz) are K-elements. 

P;. For any two K-elements x, y there is a K-element & such that xAE=y. 
Py. xOy=yOsx, if x, y, yOu are K-elements. 


P;. if x, y, z, yO2, xO(yOz) are K-elements. 


Py. xO(yAz)=(xOy)A(xOz), if x, y, z, yAz, xOy, xOz, (xOy)A(xOz) are 
K-elements. 


P;. For any two K-elements x, y satisfying the conditions xAx¥x, yAy+y, 
there is a K-element 7 such that xO7n=y. 
Conditions on boolean operations imposed by the postulates. In a boolean 
algebra any binary operation expressible in terms of addition, multiplication, 
and negation must be of the form 


Axy + Buy’ + Cx’'y + Dx'y’,§ 


where A, B, C, D are elements of the algebra. Let two boolean operations 
A, O be given by 


* Presented to the Society, San Francisco Section, April 4, 1925; received by the editors in 
December, 1925. 

t Of a pair of field operations, the operation appearing first will be the one possessing the pro- 
perties that “++” has in the algebra of rationals. 

t See E. V. Huntington, these Transactions, vol. 4 (1903), p. 31. 

§ Schréder’s notation will be employed throughout, except that x’ (instead of x) will be used 
for the negative of x. 
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(1) = axy + bry’ + cx’y + dx’y’, 
(2) xOy = pry + + rx'y + sx’y’, 
I proceed to find the conditions imposed by the postulates P,-P; on the 


coefficients of (1) and(2). 
The conditions that (1), (2) satisfy Pi, Ps, Ps, Ps, Ps, Ps are* respectively 


(3) c= 5B, 

(4) a’d + (ad + a’d’)(bc’ + b’c) = 0, 
(5) b=a',d=c’', 

(6) r= 4, 

(7) p's + (ps + p's’)(gr’ + q'r) = 0, 


(8) a’ (pq + rs) + d(p’q’ + 1's’) + (ad + a’d’ + be + b’c')(p’ q+ = 0. 

To find the condition that (1), (2) satisfy P7, note first that the condition 
xAx~x is the same as a’x-+dx’~0. Further, the condition that for given 
x, y there be an element 7 such that xO7=y, is the condition of solvability 
in 7 of the equation 

+ gun’ + rx'n + sx’n'=y, 
i. e. of the equation 
(p’xy + pay’ + + + (q'xy + gay’ + s’x'y + sx’y')n' = 0. 
The condition for the solvability of this equation is 
p’q'xy + paxy’ + + rsx'y’ = 0. 

Hence the condition that (1), (2) satisfy P, is 
(9) p’q’xy + paxy’ + 1's'x'y + rsx'y’ = 0, a’x + dx’ 0, a'y + dy’ 0. 


By (3)-(5), the condition that (1) satisfy P,-P; simultaneously is 


(10) =a’.t 
By (6), (7), the condition that (2) satisfy P,, P; simultaneously is 
(11) r=q, p's =0. 


By (8), (10), (11), the conditions that (1), (2) satisfy P:-Ps simultaneously 
are 


* For (4), (5) see my Operations with respect to which the elements of a boolean algebra form a group, 
these Transactions, vol. 26 (1924), pp. 171-175, and E. Schréder, Vorlesungen iiber die Algebra 
der Logik, vol. 2, part 2, pp. 493, 494. My results were obtained without the knowledge of Schréder’s 
work. Condition (8) I take from Schréder. 

¢ So that all operations (1) with respect to which the elements of a boolean algebra form an 
abelian group, are given by axy + a’xy’ + a’x’y + ax’y’. Compare my paper cited above. 
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b=c=ad =a ,r= 9,09 + + q's + + aq’s’ = 0, 
which reduce to 
(12) b=c=d' =a',r=q =0. 
By (10), if (1) satisfy P,-P; then d =a, and the condition a’x+dx’+0in (9) 


becomes a’x+ax’+0, i. e. x¥a. It is to be observed that (9), (12) are 
sufficient as well as necessary conditions that (1), (2) satisfy P,-P;. Hence 


THEOREM 1. The necessary and sufficient conditions that (1), (2) satisfy 
P,-P; simultaneously are 


(13) xAy axy + a’xy’ + a’x'y + ax’y’, 

(14) xOy = pry + axy’ + ax’y + sx’y’, 

(15) ap’ + a’s = 0, 

(16) a’p’xy + apxy’ + a’s'x'’y + asx'y’ = 0,% ¥a,y a. 

Existence of fields. Conditions (13)-(16) do not, of course, prove that 
fields in boolean algebras exist. To consider existence, take first the algebra 
consisting of the boolean elements 0, 1. Let a=0 in (13)-(16). Then from 
(15), s=0. Letting x=1, y=1 in (16), we get p=1. Hence, the boolean 
elements 0, 1 are a field for the pair of operations 
(17) xy’ + x’y, xy. 

Let a=1 in (13)-(16). Then from (15), p=1. From (16), by letting x =0, 
y=0, we get s=0. So that the boolean elements 0,1 are a field for the pair of 
operations 
(18) sy xy tay 

Since in a two-element boolean algebra all binary operations satisfying 
the condition of closure are of the form (1),* we observe from (13) that 
operations (17), (18) are the only pair of operations for which the boolean ele- 
ments 0, 1 are a field. 

Turning to a boolean algebra of more than two elements, let e be an 
element different from 0, 1. The element a in (13)-(16) must be 0, 1, or an ele- 
ment different from 0, 1. If a=0, then s=0. But by letting x=e’, y=e in 
(16), we get e=0, contrary to hypothesis. We obtain a similar contradiction 
fora=1. Let a be different from 0,1. From (16), making the substitutions 


= 0, 


* See my Complete sets of representations of two-element algebras, Bulletin of the American 
Mathematical Society, vol. 30 (1924), p. 26. This paper also contains a different proof of the 
fact that (17), (18) are field operations for 0, 1. 
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we get respectively 
Hence 


But this contradicts (15). We therefore find that in a boolean algebra having 
more than two elements there exist no operations (1), (2) for which the ele- 
ments are a field. 

To sum up, we have 


THEOREM 2. In a two-element boolean algebra there exist two and only two 
pairs of operations for which the elements are a field, namely 


xy’ + x'y, wy; xy + + 9. 
In a boolean algebra of more than two elements there exist no operations expres- 
sible in terms of addition, multiplication, and negation for which the elements 
are a field. 


UNIVERSITY OF CALIFORNIA, 
BERKELEY, CALIF. 
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ASYMMETRIC DISPLACEMENT OF A VECTOR* 


BY 
JOSEPH MILLER THOMAS? 


1. Introduction. Levi-Civita’s definition of parallel displacement of a 
vector { has been generalized to non-Riemannian geometries by several 
writers§ who have replaced the Christoffel symbols of the second kind by a 
set of quantities T% which are symmetric in j and k; and by Schouten), 
who has omitted the assumption of symmetry. 

The problem of the changes of the connection I’, which preserve the 
paths (or geodesics) has been treated by Weyl, by Eisenhart and by Veblen 
in the symmetric case] and by Friedmann and Schouten** in the asymmetric 
case. Such changes of the connection preserve, in general, only the directions 
of vectors displaced along themselves. In the present paper are treated 
changes of connection which preserve the directions of all displaced vectors 
(§5).t¢ It is readily shown that two distinct symmetric connections can- 
not give rise to the same displaced directions, so that the connections con- 
sidered are necessarily asymmetric in general. 

In § 7 are given some tensors which are independent of the above change 
of connection, and a process, like covariant differentiation, for forming 
tensors of the same nature but of higher rank is indicated. 

In the final section we find necessary and sufficient conditions in order 
that an asymmetric connection may be made symmetric by a change pre- 
serving displaced directions. 

2. General linear displacement of a vector. Consider a vector field £' 
in an 2-dimensional manifold referred to a coédrdinate system x. The vector 


* Presented to the Society, May 2, 1925, received by the editors in June, 1925. 

+ National Research Fellow in Mathematics. 

t T. Levi-Civita, Nozione di parallelismo in una varield qualunque, Rendiconti del Circolo 
Matematico di Palermo, vol. 42 (1917), pp. 173-205. 

§ Cf. H. Weyl, Raum, Zeit, Materie, 4th edition, p. 100; A. S. Eddington, The Mathematical 
Theory of Relativity, p. 213. 

| J. A. Schouten, Uber die verschiedenen Arten der Ubertragung, Mathematische Zeitschrift, 
vol. 13 (1922), pp. 56-81; Der Ricci-Kalkiil, pp. 62-75. 

§ H. Weyl, Géttinger Nachrichten, 1921, p. 99; L. P. Eisenhart, Proceedings of the 
National Academy of Sciences, vol. 8 (1922), p. 233; O. Veblen, ibid., p. 347. 

** Mathematische Zeitschrift, vol. 21 (1924), p. 218. 

tt This problem has been touched upon by H. Friesecke in his paper Vektoriibertragung, Rich- 
tungstibertragung, Metrik, Mathematische Annalen, vol. 94 (1925), p. 101. 
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¢* at a point P of codrdinates x‘ can be thought of as being displaced to a 
nearby point P’ of codrdinates x‘+dx‘ provided there is given a law specify- 
ing the vector at P’ which corresponds to the vector £‘ at P, or, what amounts 
to the same thing, a law specifying the vector 5£* at P’ which is the differ- 
ence between the vector of the field at the point P’ and the displaced vector. 
Adopting the latter alternative,* we say that a vector £‘ suffers a general 
linear dis placement when 


(2.1) = + 


where dé‘ represent the differentials of the functions £'. 

The quantities H‘,, which will be called the components of the linear 
connection, are subjected only to the restriction that the quantities given 
by (2.1) are the components of a vector—a restriction which determines their 
law of transformation. When the coérdinates x are changed by an analytic 
transformation to <, we have 


Hence 


it is necessary and sufficient that 


We write 
(2.3) Hu = Vir + Qu, 


* Although displacement as here defined only applies to a vector belonging to a field, we extend 
the definition to all vectors by stipulating that the changes in the components be given by— H ' ptidx* 
as far as terms of the first order. 


i 
= Hogs — —tidx*. 
Ox’ dx* 
= — dt«+ | ——_ ap —— —— }éidx*. 
axa dxidxt axt 
In order that these equations may reduce to 
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the I'’s denoting the symmetric part of H%, 


(2.4) =} (Hix + Hii), 

and the 2’s the skew-symmetric part, 

(2.5) = (Hn — 

By adding equations (2.2) to those obtained from them by the interchange 


of 7 and k, and dividing by 2, we find the law of transformation for the 
I’’s is the same as for the H’s, namely, 


(2.6) r; 


= 


Ox* dAxidx* Ox? 


From these equations it is readily proved that 


dlogA 
(2.7) = Tas — t+ 
Ox? Ox! 


where 


Ox? 


By subtracting equations of the form (2.2), we prove that Q', are the com- 
ponents of a tensor.* 

3. Fields of vectors generated by displacement and the curvature tensor. 
Let us consider a field of vectors generated by the displacement of a vector. 
The conditions for such a field are 6£‘'=0, and can be written as 


(3.1) dti + = 0, 
or as 
(3.2) = 0. 

Ox* 
The conditions of integrability of these equations are 
(3.3) Z nti = 0, 
where 

(3.4) Zi = + — x. 


Ox! 


* This result is given by Schouten, Ricci-Kalkiil, p. 67. 
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In order that there exist a vector at each point of space which can be ob- 
tained by displacement of a vector which has arbitrary initial components 
at an arbitrary starting point, it is necessary and sufficient that equations 
(3.2) be completely integrable. The conditions of integrability (3.3) of these 
equations will be satisfied identically if, and only if, 

Zin = 0. 

From the way in which these functions Z j; arise it is evident that they 

are the components of a tensor. By substitution from (2.3) we have 

(3.5) = Bir + 
where B‘,, is the ordinary curvature tensor for the I’’s, namely, 


and where 

Q},; being the covariant derivative of 2), formed with respect to the I’s: 
i an a_i 

(3.8) QE = + ar — Val — 


dx! 


4. The paths or geodesics. If we define a path (geodesic) as a curve 
whose tangent vector at any point is obtained by displacement of the 
tangent vector at a nearby point, the differential equations of the paths are 

dxi dx* 


4.1 + —=0 
ds? ds ds 


Only the symmetric part of the coefficients H, enters into these equations.* 
By choosing for 2), an arbitrary tensor, skew-symmetric in j and &, 
we can associate with any geometry of pathsf a displacement of the type 
(2.1). 
5. Changes of linear connection which preserve displaced directions. 
The vector arising from £* by displacement to the point x‘+dx‘ has for its 
components, as far as terms of the first order, 


(5.1) — 
* Schouten, Ricci-Kalkiil, p. 76. 


¢ Cf. O. Veblen and T. Y. Thomas, The geometry of paths, these Transactions, vol. 25 
(1923), p. 551-580, for a general account of the geometry of paths. 


if 

Bi 
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If we consider another set of H’s, say 


(S.2) = Hy — 
then the new vector arising by displacement has for its components 

(5.3) — + antidz. 

In order that the directions of the vectors (5.1) and (5.3) may be the same, 


it is necessary and sufficient that their difference be in the direction of either, 
or that 


(5.4) = 

to within terms of the first order. The conditions can also be written 

Since these equations must hold for arbitrary ¢’, we have 
and x* = 


If we determine A from this last relation by contraction, substitute back and 
equate the coefficients of dx* to zero, we find 


(5.5) én = — 6%. 
n 


It is seen from (2.2) and (5.2) that an isatensor. Hence a3, is a vector, 
which will be denoted by 2g,. In terms of this vector, equations (5.5) be- 
come* 


on = 25,02, 
and from (2.4) and (2.5) the changes in the I’s and ’s are, therefore. 
respectively 
(5.6) — = + 
(5.7) — = Sipe — 


It is to be noted that the skew-symmetric part of the change vanishes only 
if the symmetric part does and vice-versa. 

Conversely, if the connection is changed in accordance with (5.6) and 
(5.7), g; being an arbitrary vector, conditions (5.4) are fulfilled. 

In the associated geometry of paths, there is brought about a change of 
the affine connection I which preserves the paths. Moreover, every 


* Friesecke, loc. cit., p. 106, obtains these conditions. 
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change preserving the paths is of the form (5.6).* Hence a study of dis- 
placements of the form (2.1) is one approach to a projective geometry of 
paths. 

From (5.6) and (5.7) follow 


(5.8) 


n+1 a) 
1 fa a 
From these equations, (5.6), and (5.7) we find that the following expressions 
are independent of ¢; and are therefore invariant under the change of con- 
nection being considered:f 


é 8; a a 
(5.10) Tl 5x = = n+1 aj 
‘ 5; a a 
(5.11) Ln = Qn — 
n—1 n—1 
(5.12) Za= t+ + 
& 
5.13 
i 
(5.14) Lin = Hn — 


= 1%, =0. 


ak 


6. Normal affine connection. The quantities II) defined by (5.10) are 
the components of the projective connection.{ From (5.10) it follows that 


(6.1) 


* Cf. H. Weyl, Géttinger Nachrichten, 1921, p.99. Also L. P. Eisenhart, Proceedings 
of the National Academy of Sciences, vol. 8 (1922), p. 233, and O. Veblen, ibid., p. 347. 

t We shall use German characters to denote tensors which are independent of ¢. 

t These quantities were first employed and named by T. Y. Thomas in the Proceedings 
of the National Academy of Sciences, vol. 11 (1925), pp. 199-203. 


nf 
We note then that from these definitions, it follows that 
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if, and only if, [2,=0. From (2.7) it is seen that this will be an invariant 
property only under transformations of constant jacobian. An affine con- 
nection for which (6.1) is true will be called normal* for the given 
coérdinate system and those arising from it by transformations with con- 
stant jacobian.f 

If in the given coérdinate system we choose the components ¢; equal to 
Is,/(w+1), then from (5.8) it follows that 


(6.2) rs = 0, 
and conversely. Hence we have proved 


THEOREM 1. For any coérdinate system there is a unique normal affine 
connection. 


T. Y. Thomasf states that the components of the projective connection 
“constitute a normalized affine connection.” In the light of the above dis- 
cussion this statement must be interpreted as follows: for each coérdinate 
system there exists an affine connection whose components are equal to the 
corresponding components of the projective connection in the given co- 
érdinate system. The same equality of components also holds in coérdinate 
systems arising from the given one by transformations of constant jacobian. 
In coérdinate systems connected with the given one by transformations of 
variable jacobian, however, a different affine connection will be normalized 
and will have its components equal to those of the projective connection. 

It also follows that the skew-symmetric tensor 


Oxi 


(6.3) 


vanishes for a normal affine connection in a properly chosen coédrdinate 
system, and therefore in any coérdinate system.§ Conversely, we can show 


* E. Cartan, adopting a viewpoint entirely different from that of the present paper, includes 
1'é, = 0 in the definition of his normal projective connection. He proves a result equivalent to 
Theorem 1. Cf. his paper Sur les variétés d connexion projective, Bulletin de la Société Ma- 
thématique de France, vol. 52 (1924), p. 211. 

t T. Y. Thomas (loc. cit.) places A = 1 as a device to make the II’s have a law of transformation 
like that of the T’s. He calls transformations of codrdinates of this type equi-transformations. For 
a determination in finite form of the equations of all such transformations, cf. E. Goursat, Bulletin 
des Sciences Mathématiques, vol. 41 (1917), p. 211. 

t Loc. cit., p. 200. 

§ That it is always possible to choose the affine connection so that the skew-symmetric tensor 
vanishes was first established by L. P. Eisenhart, Proceedings of the National Academy of 
Sciences, vol. 8 (1922), p. 233. 
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that if the skew-symmetric tensor vanishes, then it is possible to choose the 
coérdinate system so that ['¢,=0. We have by hypothesis 


ar, 
Ox? 


Hence the equations 
0 log A 


are completely integrable, and it is possible to determine a A satisfying them 
and assuming an initial value different from 0. Suppose that such a solution 
is 


(6.4) A = f(x',x?,-+-, x"). 
Choose the variables (#, Z*, --- , £") as arbitrary analytic functions of 
(x1, x2, - + -, 2") subject only to the restriction that the minor of 0Z!/dx! in 


A shall initially be different from zero. Then (6.4) can be solved for this 
derivative and Cauchy’s theorem applied to show the existence of functions 
#! satisfying (6.4). In the codrdinate system so obtained it follows from (2.7) 
that =0. 


THEOREM 2. A necessary and sufficient condition for the existence of co- 
érdinate systems for which a given affine connection is normal is the vanishing 
of the skew-symmetric tensor. 


Veblen* has pointed out that for affine connections of the type under 
consideration there is a definition of volume: 


V = | ydx'dx*--- dx", 


In the codrdinate systems referred to in Theorem 2 this definition takes 
the form 


y being a constant. In a Riemann geometry, the skew-symmetric tensor is 
zero. Hence coérdinate systems exist for which the affine connection given 
by the Christoffel symbols is normal. Since in this case 


log Vg 


* O. Veblen, Proceedings of the National Academy of Sciences, vol. 9 (1923), p. 3. 
Cf. also L. P. Eisenhart, ibid., p. 4. 


Ht 
i 
“4 

f 

| 

= 
logy 
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where g is the determinant of the fundamental tensor g;;, they are the 
coérdinate systems for which g is constant. 

It is likewise seen from (5.9) that by choosing ¢; properly we can make 
= 0 


for the displacement, so that 
(6.5) Liz = Qin. 
These are invariant conditions, but in general they can not be realized simul- 
taneously with (6.1). 
A proper choice of ¢; will in a similar manner make 


(6.6) La= Bp 


for the given coérdinate system. 

7. Tensors independent of the change of the linear connection. By com- 
putation from (3.6) or by reference to Veblen and T. Y. Thomas* we find 
(7.1) BN = BY — — Bu) — + je, 


where 
Pir = + 


and gy, is the covariant derivative of g formed with respect to the I’s. 
By contraction, the following expressions are found: 


n—1 n?—1 


Siz — Sin 
where Ru=Boe is the Ricci tensor and S;; is given by (6.3). Substitution 
from (7.2) in (7.1) and separation of the accented and unaccented terms 
show that the quantities 


i i 6; 
(7.4) Bar = Bar — 


1 é ‘ 
(6,R jx — 5:Rjx) 


1 
(54S jx — S jx) 
n? — 1 


* Loc. cit., p. 559; also L. P. Eisenhart, Annals of Mathematics, ser. 2, vol. 24 (1923), p. 
377, where there are differences of sign due to different definitions of g; and B ‘al 


1926] ASYMMETRIC DISPLACEMENT OF VECTORS 667 


are the components of a tensor which is independent of the vector g. It is 
the projective curvature tensor discovered by Weyl.* It can also be ob- 
tained by expressing integrability conditions of the equations of transforma- 
tion of the components of the projective connection (5.10).t When it is 
obtained in the latter way, it is expressed in terms of the II’s. That it can be 
so expressed, follows directly from the observations leading to (6.1). In 
fact, we can prove 


THEOREM 3. Any projective tensor formed from the components of the affine 
connection can be expressed in terms of the components of the projective con- 
nection. 


By a projective tensor is meant a tensor independent of the vector ¢. 
To prove the theorem, we compute the values of the components of the 
tensor for a normal affine connection. Equations (6.1) show that each T 
can be replaced by the corresponding II. Since the values of the components 
of the tensor are by hypothesis independent of the affine connection for 
which they are computed, the theorem is proved. 

Treating (3.7) in an entirely analogous manner, we get 


Cut — — Dix) + — 
Diz — Dj! 
Oj, — = 
n—1 


Cie — Ci Diu — Di 
n—1 (n 


= 


Cr = Ce 


ika ? 


Dir = - 
It then is found that the following are the components of tensors indepen- 
dent of ¢: 

8D 1 
(7.5) = Ci — + 1 x) 


n—-1 


1 ‘ 
+ — jx), 


* H. Weyl, Géttinger Nachrichten, 1921, p. 99. 
t Cf. a paper by the writer, Proceedings of the National Academy of Sciences, vol. 
11 (1925), p. 207. 
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(3 — — Cry 
(7.6) 
(m — 1)? n—1 
Dix Siz 
m—-1 


(7.7) = 


The tensor given by (7.7) can also be obtained by expressing integrability 
conditions of the equations of transformation of the quantities (5.13). 
We find also 


 kla 
and that the Weyl tensor has the property of cyclic symmetry, namely, 
+ + Bix = 0, 


whereas the tensor Xj; does not. 
We note also that the tensor 


i 
Bint = Zin — 


arises by treating (3.4) in the same way that we did (3.6) and (3.7). From 
a consideration of equations (6.6) we see that this tensor can be expressed in 
terms of the quantities (5.14), and a theorem similar to Theorem 3 can be 
stated for projective tensors formed from the H’s. 

In addition to the above tensors which are independent of ¢, we have also 
the projective invariants given by formulas (5.10) to (5.14). Of these, the 
quantities 2%, present special interest. They constitute an affine (symmetric) 
connection which is uniquely determined by the linear connection and which 
is the same for all linear connections yielding the same displaced directions. 
Since it is readily proved that the ~’s have the same law of transformation 
as the I'’s, they form a basis for covariant differentiation which is indepen- 
dent of g. Thus from the projective tensors given in this section we can form 
infinite sequences of tensors of the same character. To obtain formulas for 
covariant differentiation we need only replace the I’s by 2’s in the ordinary 
formulas.* 

8. Semi-symmetric displacements. We next inquire under what con- 
ditions it is possible to choose ¢ so that 0% =0. A necessary condition is 
obviously the vanishing of the tensor  },, that is, 

i i 


i 0; on 
“ak 


(8.1) Qn = 
n—1 n— 


* Cf. O. Veblen and T. Y. Thomas, loc. cit., p. 571. 
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This condition is also sufficient; for, as remarked above, the vector ¢ can 
be chosen so as to reduce 2) to Qj. We can write (8.1) in the form 


(8.2) Qin = — 


where y; is a vector. This gives the type of asymmetric displacement studied 
by Friedmann and Schouten and called by them a semi-symmetric dis- 
placement (halb-symmetrische Ubertragung).* We can therefore state the 
following theorems. 


THEOREM 4. A necessary and sufficient condition that an asymmetric dis- 
placement be semi-symmetric is the vanishing of the tensor &. 


THEOREM 5. A semi-symmetric displacement can always be replaced by a 
symmetric one with preservation of displaced directions. 


It is seen from (5.12) that for this displacement, reduced to the symmetric 
form, 
‘ 
= Ve. 


We can also arrive at the semi-symmetric displacement in another 
manner. Let the components of the linear connection in a codérdinate system 
y be denoted by } and consider a vector with components 7‘ in that 
system. The vector 7‘ and that obtained from it by displacement, namely 


nt — unidy*, 


will have proportional components in the y coérdinate system, if, and only if, 
(8.3) 


at the given point. The proof of the foregoing statement is entirely analogous 
to the derivation of equations (5.5). Denoting by ~, the expressions 
ay 
a y 


- 
n 


we find from (2.2) 
Ox 


= — +3; 
Oxidx* dy 


*Loc. cit. Cf. also J. A. Schouten, Proceedings, Koninklijke Akademie van Weten- 
schappen te Amsterdam, vol. 26 (1923), p. 850, where certain applications to physics are indi- 
cated. In the latter connection, cf. H. Eyraud, Comptes Rendus, January, 1925, pp. 127-129. 
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whence by interchange of 7 and & and subtraction, 
(8.4) 205, = 


Contraction gives , 
= 
n—-1 ™ 
and substitution in (8.4) shows that (8.1) are satisfied. Hence the displace- 
ment is semi-symmetric. Conversely, if (8.1) are fulfilled and we define a 


system of coérdinates with origin at the point x‘=x9 by the equations 


(8.5) xt = xo — 3 


then by means of equations (2.2) written in the form 


a 


and the relations 


(=) 5; ( ) 
dy}, 


which are consequences of (8.5), we find that 
26 
n—1 
Hence (8.3) are satisfied, and itis seen that the quantities (5.14) vanish at 
the origin in the y coérdinate system. 


(Hn)o = 


THEOREM 6. A necessary and sufficient condition in order that a displace- 
ment be semi-symmetric is that there exist for each point of space a coérdinate 
system in terms of which any vector at the given point and that arising from it by 
displacement to a nearby point have proportional components.* 


It is to be noted that the codrdinate system given by (8.5) is independent 
of the vector ¢g and is the geodesic coérdinate system for the associated sym- 
metric connection. 

I wish to express my thanks to Professors Eisenhart and Veblen, who have 
read the manuscript of this paper and improved it by their suggestions. 

* Weyl, in Raum, Zeit, Materie, p. 100, arrives at the affine connection by postulating the 


existence of a codrdinate system in terms of which original and displaced vectors have equal com- 
ponents. 
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TENSORS DETERMINED BY A HYPERSURFACE 
IN RIEMANN SPACE* 


BY 
HARRY LEVY 


INTRODUCTION 


Bianchit has generalized the study of a surface in euclidean 3-space by 
considering a hypersurface, V,, immersed in an arbitrary enveloping space 
of one more dimension. Associated with such a hypersurface there are two 
differential quadratic forms, which in this paper we denote by gasdu*du® 
and Q.sdudu®, where the u’s (a, B=1,2,---,m) are codrdinates in V,. 
These forms, known as the first and second fundamental forms of the hyper- 
surface respectively, are defined precisely as are the two fundamental forms 
of a surface in ordinary 3-space; the first in the case of a positive definite 
form gives the square of the element of the arc, 

ds* = 

while the second may be obtained by considering the variation of the first 
order of the fundamental tensor of the space as one passes from the given 
hypersurface to a nearby hypersurface geodesically parallel to the given 
one. The variations of higher order, not considered by Bianchi, yield ad- 
ditional tensors which are intimately connected with the study of the given 
hypersurface and of the hypersurfaces geodesically parallel to the given 
one. These tensors can be found by writing the linear element of the en- 
veloping space in the form 


where the curves of parameter w° are geodesics orthogonal to V, and the 
parameter «° is the arc of these geodesics measured from V,. The coefficients 
in the expression of cag as a power series in “° are the required sequence of 
tensors, 


Cap 
The hypersurfaces u°=constant are geodesically parallel and the funda- 
mental tensor of any one of them is given by this power series with u° 


* Presented to the Society, February 28, 1925; received by the editors in February, 1925. 

t Bianchi, Lezioni di Geometria Differenziale, 2d edition, vol. I, or 3d edition, vol. II. Reference 
may also be made to other works that have recently appeared, Der Ricci-Kalkiil, by J. A. Schouten; 
Lezioni di Calcolo Differenziale Assoluto, by T. Levi-Civita; Vorlesungen iiber Differentialgeometrie, Il, 
by W. Blaschke; Riemannian Geometry, by L. P. Eisenhart, Princeton University Press. 
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replaced by the particular value of the constant. Hence these remaining 
terms are essential, in particular, in the study of geodesically parallel 
hypersurfaces. 

We devote the first part of the paper to a brief capitulation of known 
results, and to the proof of two theorems in general tensor analysis which 
we need in the later development. In the second part we obtain the general 
form of this sequence of tensors, expressing it in intrinsic form in terms of 
known functions of the space and hypersurface. In the last part of the 
paper we discuss some subcases, proving several theorems of geometric 


interest. 
The author wishes to express his appreciation of the valuable suggestions 


and helpful criticism given him by Professor Eisenhart. 
Part I 


1. We consider a Riemann space of »+1 dimensions, denoted for 
brevity by V,4:; the fundamental form of this space* 


(1.1) 
is assumed to have a non-vanishing determinant, 
(1.2) a=|a,,| <0. 

We denote by a” the cofactor of a,, in the determinant a divided by a 
itself. We may then write 
(1.3) = 6) (r,t =0,1,2,---, m), 
where 6; is a Kronecker delta, that is, 


— 


r,t=0,1,---, 
t n) 


(1.4) 


Let [rs, t] and {,*,} be the Christoffel symbols of the first and second 
kinds respectively, so that 


(1.5) ~ 


) 
Ox" Ox' 


and 
(1.6) [rst] = 


* As usual, the repetition of an index, once as subscript and once as superscript, indicates a 
summation on that index over all values from 0 to ». However, compare the remarks following 
equations (4.3). 


ft 
1,r = 
x 
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We need also the Riemann symbols of the fourth order; denote those of 
the first kind by R,,., and those of the second kind by R?,,, so that 


ret = rl, 
x p ps 


and 
(1.9) = Rorst a,,R* 


These relations may be written in a number of other ways because of (1.5), 
(1.6) and (1.9).* The properties of the R’s, namely that 

(1.10) R? = — R? 


rst rts? 


(1.11) R?,+ R?, +R? =0, 


trs 


(1.12) R orst Rorts Rypts Biss 


follow quite readily from the definitions. The reader is referred to Bianchi 
for further details. 

Let us denote the covariant derivative of R,,.. with respect to x* and 
the form (1.1) by Rorst,. and similarly let R?.:,. be the covariant derivative 
of R®,, Since the covariant derivatives of a;; and a” vanish,{ it follows 
from (1.9) that 
(1.13) R? a*Ryost,e 5 Roret,e = 


rete pst,e* 
Let us recall also the identity of Bianchift 


(1.14) Roots + + Reuss = 0. 


Successive covariant differentiation of R’,,, and Ror. gives us a 
sequence of tensors which we denote by Ry...» and R,,..4---». We 
find that these R’s satisfy the analogues of (1.13) and the equations ob- 
tained through the differentiation of (1.10), (1.11), (1.12) and (1.14). 


* Bianchi, 2d edition, loc. cit., pp. 72, 73. 

t Ricci and Levi-Civita, Méthodes de calcul différential absolu, Mathematische Annalen, 
vol. 54 (1901), p. 138. 

t Bianchi, loc. cit., p. 351. Also Veblen, Normal coérdinates for the geometry of paths, Proceed- 
ings of the National Academy of Sciences, vol. 8 (1922), p. 197. 
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2. Consider a point function f(x°, x', - - - , x") evaluated along a curve C. 
If we write 


dx’ 

(2.1) v= (r = 0,1,2,---, m), 
ds 

the \’s are the components of the vector tangent to the curve, s being its 


arc ;* the derivative of f along C is given by 
(2.2) 


where f, is the derivative df/dx". The second derivative, 0°f/ds?, can be 
obtained by differentiating the right hand side of (2.2) covariantly with 
respect to x‘ and the form (1.1), multiplying by \, and summing on ?¢. 
We obtain 

(2.3) = fri + fr", 

ds? 
where f,, and \", are the covariant derivatives of f, and A’ respectively. 

If C is a geodesic, and if we denote its arc by u°, we must haveft 


d?x" dz‘ 
(2.4) + { =0 (ry = 0,1,2,---, n). 
ij 


(du°)? du® 


This may be written in the equivalent formf 
(2.5) =0 


where X’,; is the covariant derivative of \’ with respect to x* and the form 
(1.1). In equations (2.3) the second term drops out by virtue of equations 
(2.5), so that we have 


a*f 


= 
(au°)? 


(2.6) 


By induction it follows that 
(2.7) ( 1,2 ) 


* If C is not a mull curve, that is, if, for C, aijdx ‘dx’ 0, the arc is defined, ds?= lajjdx‘dx? |. If 
C is null, s must be interpreted as a parameter along the curve. Cf. Eisenhart, loc. cit. 

t Bianchi, 2d edition, loc. cit., p. 334, or any of the other works on this subject. 

t Cf. Levi-Civita, loc. cit., p. 291, or any of the similar works. 


1926] TENSORS IN RIEMANN SPACE 


Hence we have the following theorem : 

The mth directional derivative of any function f along a geodesic is obtained 
by taking the inner product of the mth covariant derivative of f withX"d" - - - dt@, 
where d° is the vector tangent to the geodesic. 

In particular, it follows that, if —x‘ is taken for the function f we must 
have 

d™x* dx" 
(2.8) 

(du°)™ du® du° 
where the functions I'},...;, are formed from —x‘ by the formal process of 
covariant differentiation, 


(i = 0,1,2,---,#), 


0, 1 = 


-1, 

ar? 

Ox* 
r 

} 

Hence that solution of equations (2.4) which has initial values y‘ and £*, 


du® 


r = 


must be* 


1 
(2.10) xi = yi + — rT 


rs 


where {,‘,} and the I’s are evaluated for the point yi. 
3. We consider an arbitrary tensor 7;;..., expanded in a power series, 
for simplicity, in one variable, 
(0) 


(1, J, k=0, 5, 2, n). 


If we make the transformation of coérdinates defined by the equations 


(3.2) 


* Cf. Veblen, loc. cit., p. 192. 


675 
= 7°, 
| xi(z', #,---, £*) (i=1, 
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the components 7;;...% in the new codérdinate system undergo the usual 
tensor transformations, which in this case, and for a covariant tensor, 


reduce to 
(3.3) 


(a, 8B, y=1, 2,---, m). 


Since x° appears only in T and not in the derivatives, it follows by differen- 
tiation of (3.3) with respect to x° that 


(3.4) 


that is, that T ee are the components of a covariant tensor in x°=0. 


The similar result for a mixed or contravariant tensor or invariant ob- 
viously is valid. If the expansion (3.1) is in a series of several variables, 
the result holds for the variety of the remaining variables. Putting this 
in the form of a theorem, we have 
If Ts;...% and T,;...% are the components of a tensor in the codrdinate 
systems x and & respectively, and if these codrdinates are related by equations 
x* = (a = 0,1,2,---, p—1), 
x* = x4(ZP, ¥*) (u=p,pt+1,---, n), 
then the coefficients in the expansion of T,,..., in a power series in x®, x', 
- ,x?-! are the components of a tensor in the subspace x*=constant 
(a=0,1,--- , p—1), and the components of this tensor in the coérdinates 
Z?,---+ , 2" are the coefficients in the expansion of Ty,...» im @ power series 
Part II 
4. We consider a hypersurface of dimensions denoted by V, im- 
mersed in V,4: with the quadratic form (1.1). Such a spread is defined by 
the equations 


(4.1) xi = (4 = 0,1,2, - 


where the jacobian matrix 


|| 


(4.2) J= | a 


a= 
is of rank m. The fundamental form of V, is given by 


(4.3) = 


4 = Ox* Ox” 
age age aay’ 
--, 
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where 


of 


4.4 


In equations (4.3), a and 8 are summed from 1 to m, and throughout the 
remainder of this paper Greek letters will be reserved for such summations, 
while Latin letters as i and 7 in (4.4) indicate a summation from 0 to 2. 

When the form (1.1) is positive definite, (4.3) will be likewise; but when 
no assumption is made concerning the definiteness of (1.1), it is necessary 
to assume explicitly that the functions f‘ are such that the discriminant g 
of the form (4.3), 


(4.5) g = | gesl, 


is different from zero. 
As usual, g** denotes the cofactor of gas in g divided by g itself, so that 


(4.6) = (a,y =1,2,---, m), 


where 62 is defined by equations (1.4). 

Eisenhart* has shown that the condition g~0 and the condition that 
the vector normal to V, be not null are equivalent; hence if we denote the 
components of this vector by £* they may be chosen so that 


(4.7) = 


where e is plus or minus one. 

The investigations we wish to make will be facilitated if instead of the 
general coérdinate system x°, x!,--- ,2" we make use of a codrdinate 
system w!,---, u", where the «° codrdinate of a point measures’ the 
distance of that point from V, along a geodesic normal to V, and the co- 
ordinates u', u?,---, are any codrdinates in V,. In these codrdinates 
we find that+ 


(4.8) Cu =e, Coa = Can = O (a = 1,2, “++, m), 
where e is defined by equation (4.7), and hence the form (1.1) becomes 


(4.9) g = e(du°)? + cagdu*du'. 


* Eisenhart, loc. cit., p. 144. 
+ Cf. Bianchi, 2d edition, loc. cit., p. 336, when the form (1.1) is assumed positive definite. 
Eisenhart, loc. cit., p. 146, treats the general case. 
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The equation of V, in these coérdinates is u°=0, and equations (4.4) 

become 

(4.10) Bap = Cap(O,ul, ---, (a,B = 1,2,---,m). 

If c‘i is the cofactor of c;; in the determinant c=|c;;| divided by c itself, 

(4.11) = -- (a,8 = 1,2,---, m). 
We observe further that the components in the coérdinates u* of the 

vector normal to V, are given by 

(4.12) t= 0 (a = 1,2, ---,m). 
5. Bianchi considers the variation in the fundamental form of Va4, 


as one passes from V, to a hypersurface geodesically parallel to V,. Letting 
¢ be the constant distance between these two hypersurfaces, he finds that* 


OCap 
(5.1) ig = ( ) a du®, 
and he defines functions 2.3 by the equations 
(5.2) 


uo = 0 


The two quadratic forms that we are thus led to, gagdu*du® and Qasdu*du', 


known respectively as the first and second fundamental forms of the hyper- 
surface, are of prime importance in the study of the variety. But their 
coefficients gas and Q,s are only the first two terms in the expansion of Cas 
in a power series in u°, 


(S.3) Cap = Sap — + + + 


so it is natural to inquire into the nature of the remaining terms. 

That the functions c{% are of an intrinsic nature is shown by the fol- 
lowing theorem: 

The functions cj are tensors in the hypersurface and invariants in the 
space. 

The proof of the first statement is a direct consequence of the theorem 
of § 3. For the second part we observe that if we had, in V,4:, two general 
coérdinate systems x and Z it would follow that 

Ox* Axi ( _ OF Oxi OE" 


due Ox* du* Out due 


* Bianchi, 2d edition, loc. cit., p. 360. 
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and hence 
Ox* dx? 
Cap = 


(a, 8 = 1,2, -- -,m) 


is an invariant in V,,:, and consequently the functions cj are invariants 
in 
6. We devote the remainder of this paper to the determination of these 
tensors and to some applications to particular spaces and hypersurfaces. 
From the definitions of the Christoffel symbols in equations (1.5) and 
(1.6), we have that, for the fundamental form (4.9), 
1 


(6.1) [00,0] = [00,2] = [02,0] =0, [0a,8] = 


and hence that 


If we evaluate Roago from equations (1.7), we are able to solve for the 
second derivative, obtaining that 


OR + 1 ” OCay ( B 1.2 ) 
(au)? — 


and hence, by virtue of (4.11) and (5.2), 


(6.3) 


(2) 


(6.4) Cap = Roago 
where 
(6.5) a = = 


and it is understood in (6.4) that Roago is evaluated on the hypersurface. 
For a general coérdinate system, x°, x',---,", in the space, the third 
fundamental tensor is given by 

(2) Oxi dx* 


(6.6) Cap = 


+ 


where the R’s here are the Riemann tensors with respect to the fundamental 
form of the x’s. 
We can obtain 0°c.g/(du°)* by differentiating equations (6.3), 


0 1 0c 


(6.7) 


(au)? 2 du® 
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The ordinary derivative of Roaso has not an invariantive significance, 
and so it is preferable to replace it by the covariant derivative Roago.o. 
From the definition, simplified by virtue of (6.2), we obtain that 

1 Oa» R 
(6.8) = Roaso,o + 
Furthermore, from 


(6.9) = yg = 


it follows that 


6.10 cer =— 
) ou 


Hence equations (6.3) are equivalent to 
1 OCay 


from which we obtain that 


20u® 


Substituting (6.8), (6.11), and (6.3) in (6.7) and making a slight simplifica- 
tion due to the equalities 


(6.12) Roaso = Rogao; R® = CYRoya0 = C87 Roar; 


0a0 


we obtain that 


To obtain the fourth derivative in intrinsic form, we observe that be- 
cause of equation (6.2) we are able to write 


OC 


ORoas0,0 
0 au? 


Roayo,o 


1 
(6.14) = Roaso,00 + — cH” + 
2 


Furthermore, in the consideration of the term 


OC ua 


cH 


= 1,2, ---,m) 

ou® 
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of equations (6.13), differentiation of the first two members by application 
of equation (6.11) introduces, among others, the term 
OCya 
au® au? Room, 


while the application of formula (6.8) to the differentiation of Roo intro- 
duces 


OC ua OCr, 


Roc go 
because of equations (6.10) these expressions are equal numerically, but 
opposite in sign; the two similar terms obtained in the differentiation of 


0c 
cu Roa 70 


likewise cancel each other. These are the only cancellations occurring, so 
that we have 


(aus = 2Roaso,o0 + Rouso 


OCup 
(6.15) + ( Rovgo,0 + 


See 


Xo 


Similarly, we obtain 0°c.s/(0u°)® by differentiation of (6.15), but instead, 
let us turn to the general derivative to evaluate it. 

7. For the consideration of the general coefficient it will be desirable 
to introduce a slightly more convenient notation, in part necessary for the 
sake of clearness. We can see that the covariant derivatives of Roaso with 
respect to «° and of all orders will appear in the following pages and we shall 
denote by Roaso,tn and Roo, the /th covariant derivative with respect to 
u° of Roaso and Rojo respectively. Thus, 


a a 
Roaso, = Koaso,o00; Rogo,t41 = Rogo,oo00- 


For symmetry in our formulas, we shall interpret Roago, to) as merely Roago, 
that is, 


a a 
Roaso,to) = Roago, Rogo,to) = Rogo. 
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With this notation we are able to write the general formula suggested 

by equations (6.8) and (6.14), namely 

0 1 0c va Oc »B 
This follows immediately from the definition of covariant differentiation 
when the Christoffel symbols are subject to equations (6.2). Similarly 
we have for the covariant derivative of the four-index symbol of Riemann 
of the second kind, that 

(7.2) Rogo, + Royo, — Rogo,(n- 

In the following discussion there will arise contracted products o1 
several R’s. We introduce the notation Ri,,-.- kyjes, defining it by the 
equation 
(7.3) kylas = Roy20, Rosso, tea) Ron” 10, [ky-1) Rows 60, tkyl 
We note that NV may take on any value 2, 3, - --, while the k’s may take 
on 0,1, 2,---. For further clarity let us observe that 

08 
R210} «8 


From the definition, from equations (6.12) and from the equations de- 
rived from (6.12) by covariant differentiation, it follows that 


(7.4) Rikrks...kylas = kikyky—1] 


If we differentiate equations (7.3) covariantly with respect to u°, we 
find that 


N 
t=1 


Again, if we evaluate (0/0u°)Rix,.--%yias by means of equations (7.1) 
and (7.2) we find that the term Roj,,,0,(., of R contributes the single term 
Ros,,,9. 0% +0, for the other two terms which arise from the derivative of 
Roj,,,9. 4) are cancelled by two similar terms one of which arises from the 
differentiation of Ro;"o.te<,) and the other from the differentiation of 
hence we have that 


n 
u 


OC 


1 Ova 1 


Riki: kylas- 
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We thus see that the differentiation of Ri:,... tyjas introduces terms 


of the type 
8 
--kylap 


and accordingly we should like to have the expression for the derivative 
of a term of this type. Equations (6.11) and (7.6) enable us to obtain it; 
here again there will be a cancellation, the second term of (6.11) annulling 
one of the terms introduced by the differentiation of R. Observing that 


Riel... = Rikyky - ky_,0as, We are able to write 
1 N-1 


OC 
(7.7) cur Rth.. = 08 
Ou 


Rin. +1 kip 1..-kwlaB 


[k1..-kwlrpe 


Either from equations (7.4) or directly, we have the similar result that 


=~ 


(k1...kw] 


Finally, we observe that we need a formula for the derivative of 


Olea 

This we obtain from a combination of equations (6.11) with either (7.7) 
or (7.8). Here again the term arising from the second term of (6.11) will 
cancel the one arising from the last term of equations (7.7) (or (7.8)). The 


result therefore is 


Cea 
Ath. + 26 —10) 48 


t=1 


Roo 
cur 
Ou 
+ 1 R 
2 du® 
(7.8) 
= 
2 du® 
‘ = 2cu 
OCea OC 
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The formulas here obtained, namely (7.6), (7.7), (7.8) and (7.9), indicate 
the general character of 0"c,s/(@u°)™ and lead us to write the following 


expression : 
O"Cas = (m) 
= 2Roas0, + 
(du°)™ o,N 
(m) 
+ au? { (m 1) Row so, (m—3) + 
7.10 
OCra — (m) 
TN 
O6ra (m(m — 3) 


where the D’s, E’s, and F’s are constants to be determined, and where )-7, 
represents the sum over N =2, 3, 4 - - - and o;=0, 1, 2, - - - and such that 
2N+o,+0.+ --- +ov=m. 

Since 0"¢.s/(du°)™ is symmetric in a and 8, the right hand side of (7.10) 
must be also, and independently of the R’s; this gives us conditions which 
by virtue of equation (7.4) become 


(m) (m) 
(m) 
(7.11) En... = 
(m) 


If we form 0”*'c,3/(du°)"*! by differentiating (7.10) with respect to u°® 
making use of the four fundamental formulas (7.6), (7.7), (7.8) and (7.9) 
and of equations (7.4) and (7.11), and compare the result with the equation 
obtained from (7.10) by writing m+1 for m, we obtain the justification for 
the precise numerical coefficients of (7.10) and at the same time we find 
recursion formulas for the D’s, E’s, and F’s. To facilitate this differentiation 
and the collection of the results, we observe that there are three types of 
terms in (7.10), those in which 0c,,/0u° does not appear, those linear in 
these derivatives, and finally, those quadratic in them. Those of the first 
type can arise only from those of the first or second type, those of the second 
type arise from all three types, but those of the third type arise only from 
those of the second and third types. With these observations the recursion 
formulas follow immediately from equations (7.6) to (7.9); they are 
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(m) m-3 3 
- pint Pigs 2(m — Soy y 


(m) 3 0 


(7.12) + + 2(m — Soy 
(m) 


m m (m) 


(7.13) m—4 0 0 
+-m(m — + 


(m+1) (m) (m) 


(7.14) (m) 
+ 4 
where the 6’s are defined by equations (1.4) and where a D, E, or F with 
a negative subscript is to be interpreted as zero. 
Summing up these results in the form of a theorem, we have the following: 


The mth derivative of Cag with respect to u° is expressible as a non-homo- 
geneous polynomial of the second degree in the first derivatives Oc,/du° 
(A, w=1,2,---,) whose coefficients are polynomials in the Riemann tensor 
and its covariant derivatives of orders less than and equal to m—2. 


(m) 


8. The fundamental tensors in questions, cag, are the derivatives 
evaluated on the hypersurface u°=0. We recall the definition of Q.s and 
0% from equations (5.2) and (6.5) respectively, and the relations (4.10) 
and (4.11) between gas and Cas. We observe further that if Risat.e-- + 
is the R in a general coérdinate system, we must have 

Ox* dx! 


where £'=0x‘/du° is the normal vector to V,. Hence this theorem follows : 


The fundamental tensors of a hypersurface are expressible in terms of the 
first two fundamental tensors of the hypersurface and of the Riemann tensor 
of the space and its covariant derivatives, evaluated on the hypersurface. 


For reference we shall give the explicit values of the first five of these 
tensors, the first three being obtained directly from equations (6.4), (6.13) 
and (6.15) and the others by repeated application of the recursion formulas 


m+1 
.. 
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(7.12), (7.13) and (7.14)*: 


Cap = Roago + 


@ 2 
Cap = TT Roaso,o — rT (RoayoQs + Roypo2a) 


1 


— + , 

1 

Cas = t + 14RoaoRous0,0 + 
— 80% (Roxs0,00 + 2RouoRorso) — 805 


» 
+ + 202.2% Rorno,0} . 
For m=6, we find 


@ 1 
Cap = + + 


+ + 32Ro 


10 » , 
(Rou so,000 + 3RopoRorp0,0 + 


10 » , 
+ 3RopoRora0,0 + SRoyo,oRora0) 


1 
+ (36Ror + 32RoroRoryo) - 


Part III 


9. We finally turn to the consideration of the tensors, and to obtaining 
some general properties. We have already observed in § 5 the geometrical 
interpretation Bianchi obtained for the functions Q.s. We shall here give 
another, one which has its analogue in 3-dimensional differential geometry. 

Consider the hypersurface generated by the geodesics of V4, tangent 
to V, at an arbitrary, but fixed point P of V,; such a hypersurface we call 
the tangent geodesic hypersurface to V, at P. We prove the following theorem : 


* These values of c\) were found by me first by making use of general codrdinates. However, 
at the suggestion of G. Y. Rainich I made use of the particular coérdinate system introduced in 
§4, and that not only enabled me to obtain my previous results more readily, but also made possible 
the determination of the general expression for os) (equations (7.10)). 


[October 
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If V,, is the tangent geodesic hypersurface to V,, at a point P and if P’ is 
a point of V, neighboring to P, then the distance from P’ to V, is given, except 
for terms of higher order, by —}eQasdurdu’. 


The proof is very direct. Let us find a parametric representation of the 
tangent geodesic hypersurface. The geodesic through the point with co- 
ordinates and direction (¢°, ¢, - - -, ¢*) at this point 
has for its equation, when the codrdinates of the space are the w’s, the 
infinite series 


where the functions T are formed with respect to the linear element (4.9) 
in precisely the way in which the I'’s of equations (2.10) are formed with 
respect to the linear element (1.1) and are evaluated for the — 10). 
If we take for the point P with codrdinates (0, » Uo) 
and for the direction one with components (0, ¢', - - - , ¢*), the 
is tangent to V, at P. When we set v* = {*s, equations (0.1) become 


1 
w= — { py} 31 T's — 
(9.2) 


a lia 


These equations, when we regard 2v!', v”, - - - , 7" as parameters, define the 
tangent geodesic hypersurface. Since the u° codrdinate of a point (w°, 
u',---,u") is the distance from the point to the hypersurface V,, the 
required distance, D, from a point P’ of the tangent geodesic hypersurface 
to V, is given by 


D = —4 {6,} 0A”. 
But v* is du® except for terms of higher order; furthermore, by virtue of 
equations (4.8), {,°,} =eQ,,. Hence 
(9.3) D = 


This proves the theorem. We may make the following further observations. 

The contact between a hypersurface and its tangent geodesic hyper- 
surface is in general simple. A necessary and sufficient condition that it be 
of order higher than the first at a point P and in the direction ¢ is that 


(9.4) = 0 
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at P. If (9.4) holds for every direction and at every point of the hyper- 
surface, we have 

Qap = 0. 
We observe that hypersurfaces for which 2,.3=0 may be classified by 
means of the equations 


= = = 0, ~ 0. 
The larger the value of m for which this is true, the more nearly will the 
geometry of the space and the hypersurface be the same. In general a 
space does not contain any hypersurface for which & =0 for all m. We 
shall not at present carry the problems here suggested any further; instead 
we shall consider, in the remainder of this paper, a few special cases. 

10. First suppose the form (1.1) or its equivalent (4.9) is reducible to a 
form with constant coefficients. Such a space we call a flat space. We observe 
that a flat space with a positive definite form is euclidean. 

For a flat space, Ri;,: and its covariant derivatives vanish identically ; 
from (6.13) it follows that c%=0 for m=3. For cy, m<3, equations 
(8.2) become 


(10.1) Baby Cog = — Wap, = 


C4 
Bianchi* has shown that 
ati afi 
du? 


where the f’s are those of equations (4.1), [rs, ‘], are the Christoffel symbols 
of the space with reference to the form (1.1) and &* is the vector normal 
to V, referred to the general coérdinate system of (1.1). 

If the space is flat, we may take generalized cartesian codrdinates, 
that is 

0 (i #79), 
(10. 3) 1 (i = 0,1, ---,m), 
=-1 G =m+1,---,m). 

Then we have 


m gti N gti 


OU% oud j=m+1 dub 


* Bianchi, loc. cit., p. 360. 


afi aft 
— — 
iz 
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and hence 
N ati agi 


jom+10U% 


(10.5) 


This is precisely the coefficient of the third fundamental form of the 
hypersurface for euclidean space* in which m =n. 

If V, is a hypersurface geodesically parallel to V, and if we denote by 
Zes, Das, rp the coefficients of its three fundamental forms, we have from 
equations (5.3) that 


= £248 20a 8¢ + Qa 
(10.6) = — 


af 


where c is the constant distance from V, to Vn. We observe 


In a flat space, hypersurfaces geodesically parallel have the same third 
fundamental form. 


The last two theorems of § 12 hold for flat spaces but as the proof for 
the special case is similar to the proof for the general case we shall give 
only the latter in the later section. 

11. Let us next consider the case when V, is such that Rjjx1,n=0. 
This group of spaces contains as a subset the spaces of constant Riemann 
curvature; for if the curvature of V,,, has the constant value of Ko, thenf 


(11.1) = — 


and from this it follows that Ri;x:,..=0. We can give a geometrical inter- 
pretation of the vanishing of R;;x:,m by means of the following theorems. 

Let A‘(¢=0, 1, 2,---,m) be the components of an arbitrary contra- 
variant vector, defining the congruence of curves whose differential equa- 
tions are 


Further, let \,);‘ and A2)' be the contravariant components of two vectors 
arbitrary except that each forms a family of parallels along and with 
respect to the curves of the given congruence, in the sense of Levi-Civita. 


* Bianchi, loc. cit., p. 474; cf. Eisenhart, loc. cit., p. 219, for the general flat space. 
+ Bianchi, loc. cit., pp. 75, 343. 
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Then 


A necessary and sufficient condition that the Riemannian curvature of 
(1.1) be constant along the curves of the given congruence and for any pair of 
directions of the type of dy and dz is that Rijxi,m" vanish. 

Before we prove this, let us observe that an immediate consequence of 


it is the following theorem: 


If the curvature 1s constant along the curves of n+1* independent con- 
gruences in arbitrary pairs of directions which remain parallel, then Rijxi,m 
vanishes. 


Let us return to the proof of the first theorem. The Riemann curvature K 
in the directions and A2,‘ when and Az; are orthogonal unit vectors, 
is given byt 
(11.2) K=+ 


If each of the directions \,, and \.; forms a system of parallels, in the sense 
of Levi-Civita, aiong the curves of a congruence defined by a vector \' 
we must have thatt 

(11.3) heidi = 0 (c= 1,2; i=0,1, -m) 


where A,;*,; is the covariant derivative of \,,‘ with respect to x’ and the 
fundamental form (1.1). Differentiate (11.2) covariantly with respect to 
x”, multiply by \” and sum for m; because of (11.3) we obtain 
OK 
If A is constant along the curves defined by X‘, (0A /dx™)A" must vanish 
and since \,; and A. are arbitrary, we have from (11.4) and (1.12) that 
(Rijkt.m + Rx jit,m)X”™ = 0 (i,j,k, = 0,1,2,-- 
Since this holds for all values of 7, 7, k and /, we have also 
— (Rikijm + = 0. 
Furthermore, from (1.11) we have 
(Rijctsm + + = 0. 


* Note n+1 is the dimensionality of the space. 

ft Bianchi in deriving this formula does not assume that A, ii and )2\‘ are unit orthogonal vectors, 
so his value of K, loc. cit., p. 343, differs somewhat from (11.2). In what we do here, it is no restric- 
tion to make this hypothesis. 

t Levi-Civita on Parallelism, Rendicontidel Circolo Matematico di Palermo, vol. 42 
(1917), or Bianchi, Rendiconto della Reale Accademia delle Scienze Fisichee Matemati- 
che di Napoli, ser. 3, vol. 27 (1922). 
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Add these three equations, and we find 
(11.5) = 0. 
Conversely, if (11.5) hold, from (11.4), (@A/dx")\" vanishes, and K is 
constant along the curves defined by AX‘ in directions parallel with respect 
to these curves. This proves the theorem. 

For a space satisfying the condition of the second theorem* the 
sequence of tensors (8.2) becomes the following: 


(2) » 
Roago + QayQ;, 


31 (Roas0®, + 


M A2 Am—2 \m—1 m— 194 


(m= 2, 3, ). 


For the more restricted case of spaces of constant curvature equations 
11.1) hold and in the coérdinate system of (4.9) we have 
= a - a 
Roago = — eK ogas, Rogo = — eK ob. 
Hence (11.6) become in this case 
Kav 


) J2m-1 ) 
(2m >m—1 (2 
(2m) ! 
J2m+1 
= —(—e)™ Ky Gas 
2m ! 


The summation of (5.3) can be effected quite easily; we obtain 
1 


Qag sin (2 /eKo u°) 
V 


Cas = Sap 
(11.8) 
{1 — cos (2v/ u®) 


* That is, if Rij k1,m=0. 
7 If eX» has a negative value, (11.8) must be replaced by a similar formula with hyperbolic 


functions instead of trigonometric. 
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(3) 

(11.6) 

(2m) 

(2m) ! 

(2m+) 
(11.7) 
(m =1,2,---). | 
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Hence we may state the following theorem : 

In a space of constant curvature, Ko, the fundamental tensor of a hyper- 
surface geodesically parallel to and at a distance u° from an arbitrary hyper- 
surface is given by (11.8). 

By differentiation of (11.8) we obtain that for hypersurfaces geodesically 
parallel 

1 — 
Qas = Qag cos (2 eKy — sin (2\/ 
(11.9) 
= Qag eKosin(2V/ eKo + (2\/ eKo 


We shall return to the consideration of these equations at the end of 
the next section. 

12. Let us finally consider a special type of hypersurfaces, which, in 
one sense, are analogous in the general space to spheres and planes in 
euclidean space. Bianchi has shown* that the curvature, 1/R, of the 
geodesics of V is given by 

1 


Rs gagdu*du® 


The maxima and minima of 1/R are given by the determinantal equation 


1 
(12.2) | — = 0, 


and the corresponding directions, called the principal directions, are solu- 
tions of 


1 
(12.3) ( = 0 


There are m sets of principal directions corresponding to the n roots oi 
(12.2); the m congruences of curves whose directions at every point co- 
incide with the principal directions are the lines of curvature; 1/R is called 
the normal curvature of the hypersurface. 

It follows immediately that a necessary and sufficient condition that 
the lines of curvature of a hypersurface be completely indeterminate is 
that Q.3=(1/R)gas for all aand $8. These hypersurfaces are, in one sense, 
generalizations of spheres and planes; more particularly if 1/R=0 we 
have the type of hypersurface mentioned in §9. We shall prove the fol- 
lowing theorem : 


* Bianchi, loc. cit., p. 366; Eisenhart, loc. cit., p. 151. 
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In a space of constant Riemann curvature, the hypersurfaces whose lines 
of curvature are completely indeterminate have constant normal curvature and 
constant Riemann curvature; conversely if a hypersurface of space of constant 
Riemann curvature has constant Riemann curvature, then in the enveloping 
space it has constant normal curvature and completely indeterminate lines 
of curvature. The Riemann curvature of such a hypersurface is equal to the 
sum (or difference) of the square of its normal curvature and the Riemann 
curvature of the enveloping space.* 

The proof is an immediate consequence of the equations of Gauss7 
connecting the first and second fundamental forms of a hypersurface. 
For spaces of constant curvature, Ko, these equations are 


(12.4) — = Rasys — — 


where Ragys is the Riemann tensor formed with respect to the quadric form 
(4.3) and e is defined by (4.9). If the hypersurface has indeterminate lines 
of curvature, 2,;=(1/R)gas and consequently (12.4) become 


(12.5) Rasys = ( R > Ke)( — Br ). 


From the theorem of Schurf it follows that e/R*?+ Ko is constant, and 
from (11.1) it is the value of the Riemann curvature of V,. Conversely, 
if (12.5) are satisfied, from (12.4) it follows that every two-row determinant 
from the square matrix ||Q.s|| is equal to the corresponding two-row de- 
terminant from ||g.3/R||. Furthermore, if 1/R#0 there is a three-row 
determinant from the latter matrix different from zero, since the deter- 
minant |g.s|~0. It then follows from a theorem due to Killing§ that the 
elements of the two matrices difier at most in sign; that is 


1 
(12.6) Qas =+ (a,B 1,2, 


If 1/R=0, from (12.1) we have that Q2,.,=0 and (12.6) hold in this case too. 
This proves the theorem. 

We observe that e is necessarily positive if the form (1.1) or its equivalent 
(4.9) is positive definite; hence we have the following immediate corollary : 


In spaces of more than three dimensions of constant Riemann curvature, 
Ko, whose fundamental form is positive definite, there are no real hypersurfaces 
of constant curvature K < Ko. 

* The converse theorem holds only for n>2; the direct theorem however is always true. 

t Bianchi, loc. cit., p. 362; Eisenhart, loc. cit., p. 150. 


t Mathematische Annalen, vol. 27, p. 563. 
§ Nicht-Euklidische Raumformen in Analytischer Behandlung, Leipzig, Teubner, 1885, pp. 236-237 
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As a special case we have the well known theorem that in euclidean 
4-space there are no 3-spaces of constant negative curvature. 

If equations (12.6) hold, we find from (11.8), by virtue of the first 
of (11.7), that cas is proportional to gag and hence the map between V, 
and V, established by the geodesics of Vay; normal to V, is conformal. 
Conversely if a hypersurface is mapped conformally on all the hypersurfaces 
geodesically parallel to it, from (11.8) we obtain that Q.3 and c® are pro- 
portional to gag. We observe further from (11.9) that, if Qas=(1/R)gas, 
we have that Qa3=(1/R)cas- Hence we have the following theorems: 


If, in a space of constant curvature, a hypersurface is mapped conformally 
on the hypersurfaces geodesically parallel to it, the map being established by 
the normal geodesics, then these hypersurfaces have completely indeterminate 
lines of curvature. 


The hypersurfaces geodesically parallel to a hypersurface with indeterminate 
lines of curvature in a space of constant Riemann curvature are themselves 
hypersurfaces wilh indelerminate lines of curvature. 


Because of the first theorem in this section we may restate the preceding: 
In a space of constant Riemann curvature, a hypersurface geodesically 
parallel to a hypersurface of constant Riemann curvature must also be of 
constant Riemann curvature. 
In a flat space hypersurfaces with indeterminate lines of curvature must 
be hyperplanes or hyperspheres, and so the last two theorems are trivial. 
From (12.3) we readily obtain that 
(12.7) Qaprrj = 0, = 0 (h,k =i, 3h k), 
and from (11.8) and (11.9) by virtue of (12.3) and (12.6) it follows that 
= 0, 


Equations (12.7) are also sufficient conditions that the congruences A,\* 
(kh=1, 2,-+-,%) be lines of curvature, for if in (12.7) we let h be fixed, 
and k take on the values 1, 2, - - - , m, #h it follows that the vector Qasdaj* 
is normal to the m—1 vectors A,; hence Qagdn,;*=pAn 3 OF An, is a line of 
curvature. Consequently we have the following theorem, which holds also 
in euclidean space: 


In a space of constant curvature the lines of curvature of geodesically 
parallel hypersurfaces corres poitd. 
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A COMPARISON OF THE SERIES OF 
FOURIER AND BIRKHOFF* 


BY 
M. H. STONE 


In his thesis, Haar has made a comparative study of developments on 
(0, +) in terms of the set cos x, V2/mcos 2x, - - - , and of acer- 
tain Sturm-Liouville set. He proves that the term-by-term difierence of the 
formal series for any function summable with summable square converges uni- 
formly to zero on the interval (0, 7)t. Recently, Walsh, by entirely different 
and elegant methods, deduced a similar theorem for the set /2/" sin x, 
4/2/m sin 2x,---, and a second Sturm-Liouville set on the interval (0, 
m)t. These two papers naturally suggested the possibility of analogous 
theorems for the series which Birkhoff defined in a memoir on the expansion 
problems of linear homogeneous differential systems of the mth order, 
and which are generalisations of the Sturm-Liouville series.§ Tamarkin, 
in a paper written to supplement the one just cited, compared the general 
Birkhoff series on the one hand and Fourier series on the other.|| He dis- 
cussed only Riemann integrable functions and did not investigate the actual 
term-by-term difference of the series under consideration. In a later work 
on expansion problems which came to our attention after the completion 
of the present paper, he obtained, by methods different from our own, 
certain of our theorems, which will be noted subsequently.{{ Insofar as 
Tamarkin’s treatise concerns us here, it consists in an extension of his 
previous results to the case of the summable function. 

In the present paper we study the comparative properties not only of 
the Birkhoff and Fourier series for an arbitrary summable function, but 
also of the formal series obtained by deriving these term-by-term & times. 


* Presented to the Society, December 30, 1924; accepted in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy at Harvard University; received by the editors of 
these Transactions in October, 1925. 

t Haar, Mathematische Annalen, vol. 69 (1910), pp. 331-371. 

t Walsh, Annals of Mathematics, ser. 2, vol. 24, pp. 109-120. 

§ Birkhoff, these Transactions, vol. 9( 1908), pp. 373-395. 

|| Tamarkin, Rendiconti del Circolo Matematico di Palermo, vol. 34 (1912), pp. 345-395. 

§] Tamarkin, OnCertain General Problems in the Theory of Ordinary Linear Differential Equations 
and the Expansion of an Arbitrary Function in Series, Petrograd, 1917, in Russian: cited henceforth 
as D. E. 
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While our main purpose is the investigation of these questions, we 
first make generalisations of the series defined in Birkhoff’s memoir. These 
generalisations are not trivial, and furthermore occupy a central position 
in the theory of linear differential equations. For this reason we take the 
liberty of describing them here. 


J. EXISTENCE THEOREMS 


In the following pages we shall have constantly before us the differential 
equation 


where ps, - - - , Pn are real or complex summable functions of the real variable 
x on the interval (0, 1), and \ is a complex parameter. The usual existence 
theorems are obviously inapplicable to the present case. We employ a 
modification of the method of successive approximations to show the 
existence of “‘solutions” of such equations.* We first prove 


THEOREM I. Jf 

(1) pi, +++, pn, 7 are real or complex summable functions of the real variable 
x,asxsb; 

(2) Xis a complex parameter restricted to the circle <A; 

(3) xis any point of (a, b); 

(4) Co, +++, Cn. are any real or complex constants; 

then there exists a constant 5s>0, independent of Xo, Co, Cn—1; and 
there exists a function u(x, ), defined for all d, |A\ SA, and for all x on the 
interval common to (a, b) and Xo +64), such that 

(1) X) is continuous in x and analytic in X, k=0,--- , n—1, the 
differentiation being with respect to x; 

(2) A) k=0, -- n—1; 

(3) u(x, A) is summable in x for each value of d; 

(4) (x, dX) (x, d)dx; 

(5) + +(pratrA)=r, except possibly on a set 
of zero measure, Ey, for each d; 

(6) u(x, r) is unique. 

In the case =~2= -- =~, =0, A=0, the equation reduces to u™ =r; 

and we can construct a function with the properties (1)—(5) of the theorem 
for the whole interval (a, b). By direct verification we see that it is 


2 (n — 1)! (n — 1)! 


r(é)dé. 


u = Co 


* Tamarkin has obtained existence theorems here differently, D. E., Chapter I. 
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Hence in the general case we attempt to solve the equation 


xo) Cn—1(X Xo) +f (x — é) 


2 — 1)! [S(u) + 
S(u) = — — --- — — (p, + Adu. 
We first form the sequence of successive approximants [”;] where 
2 (nm — 1)! 
(x — 


= Up + [S(us) + r(é) ]dé. 


uo co 


To consider the convergence of this sequence we construct a second sequence 
where 


% = 


1= = (n— 1)! [.S (10) r|dé, 


Vit = = p dé (¢ = 1,2 ) 

i+1 i+1 7 1) Vi ° 
Clearly, for k=0, 1, 2,---,n—1 andi =1, 2,---, and for all x, xo, on 
(a, b) we have 


®| <C, < 
sc, 


— 
i+1 k—1)! (v;)dé 


Next, by a fundamental property of the Lebesgue integral, we can write 


1 
| )dé} — 


for all X, |A| <A, and for all x, |x—2o| S64, where 54>0 does not depend on xo. 

We now show that if the inequality |v,|<C/2* is true, |x—xo|<é,, 

it is also true when 7 is replaced by i+1; since it holds for i=1, it will be 

true for allz. We have 

Cc 
3 


f +++ +] + [A] > 


xo | 


7 
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Hence the infinite series +2.°%+ ---,k=0,---,n—1, converge 
uniformly, |«—2o|<6,, |A| SA. The individual terms are continuous in 
x and polynomial in }. We can thus be sure that the limit functions are 
continuous in x ard analytic in A, and can write 


lim = limus® = u(x,d). 
ime a=l 
The functions “‘"(x, \) therefore satisfy conclusion (1) of our theorem. 
On allowing i to become infinite in the relation connecting uj4: and 1, 
we find 
u(x,r) = uo(x) +f ————~ [S(u) + r]dt, |x — xo| S da. 
(n—1)! 
By direct computation we see that (2) is fulfilled. On deriving the identity 
n times with respect to x there results 
u\™(x,r) = S(u) + r(x) 
except possibly on a set Z, of measure zero. Thus (3) and (5) are satisfied. 
Then we see that 


= + f [s(u) + 


= +f u(x ,rA)dx 
7 


which is conclusion (4). 

We now assume that on some interval containing x there exists a second 
function a(x, X), |A{ SA, with the properties 
(1’) A), k=0, - - - , m—1, is continuous in x for each value of 
A) R=0,-- , n—1; 

(3’) w(x, X) is summable in x for each value of A; 

zero measure. 

The difference U =u—d is then a function satisfying the conditions (1’) 
-(5’) with ¢,=0, k=0,---,2—1, and r(x) =0, the set being replaced 
by the set E,’"=E,+£,’. We suppose that at some point (Zo, Ao) in the 
domain of definition of U(x, \) we have U (Zo, Xo) #0. 

By the part of the theorem already proved we now construct functions 
, Un, satisfying the conditions (1)-(5) with (Zo, A) 
the well known Kronecker symbol, and r(x) =0. We then form 


1926] SERIES OF FOURIER AND BIRKHOFI 


with W(x, A) =0, W(%o, A) =U (%o, A) #0. It is easily seen by the use of 
(5) and (5’) that we have 


W"(x,d) = — pi(x)W(x,d) 
almost everywhere, while by (3) and (3’) it is clear that 


W(x,d) = f W'(x,d)dx. 


On the other hand the function 


x 
Pid. 
W(x) =e 
satisfies 


Wé (x) = — pi(x)Wo(x) 


almost everywhere, and 


Wo (x) = 1 +f Wo'(x)dx. 


We then find that (W/W.)’=0 almost everywhere; and we can integrate 
this relation obtaining W=c(A)Wo. Since W(x, 4)=0, it follows that 
c(A) =0, W(«, \)=0. This leads to the contradictory statement W \) 
= U(#o, \)=0. We thus have U(x, \)=0 over the domain for which it is 
defined; in other words, u(x, \) is unique, as asserted in the sixth point of 
the theorem. 

It is now easy to demonstrate the general existence theorem, which is 


THEOREM II. Under the hypotheses (1)—(4) of Theorem I there exists 
a unique function u(x, \), a<x <b, such that 
(1) w(x, X) is continuous in x and entire ind, k=0,---,n—1; 
(2) u™ A) R=0,---,n—1; 
(3) u(x, X) is summable in x for each value of d; 
(4) u(x, A) +f (a, d)dx; 
(5) uM+ pur) + ++ + +(prtrA)u=r, except possibly on a set 
E of measure zero, independent of X. 


At x» we form the interval Io: |x—2o|<5,; at the right-hand end of To, 
xo+6,, we form the interval J,: |*—2o—6,| <6,; and we continue, forming 
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I;, - - + , Iv, where N is large enough that Jy contains x=). Similarly, we 
form a chain of intervals J_,, - - - , 7-» running to the left, with M large 
enough that 7_ 4 contains «=a. Then in J, we use Theorem I to set up a 
function u(x, ) satisfying conditions (1)—(6) of that theorem. In J; we 
apply Theorem I to define a function w(x, \) satisfying (1)—-(6) in 4, with 
(xo+5a, A) (xo +64, A); by (6), w and m coincide, x» Sx 
<A. We then perform a similar set of operations in - - 
I_y. The result is the formation of the function u(x, \) whose existence is 
asserted in the present theorem. 

For this function properties (1)—(4) are immediate. In order to prove 
(5) we start from the fact that for each \ the differential equation is satisfied 
except on a set E, of zero measure. We denote by X’ the points of the A-plane 
with rational coérdinates. We then define the set of zero measure 


Writing 


P(x) = f (| +1 


we denote by €, the set of zero measure on which P’ does not exist, is infinite 
or does not coincide with the integrand in the definition of P. We show that 
we can set €E=€,+€:. We let x’ be any point of (a, ) not in €, \ any point 
of the complex plane, and )’ a point sufficiently close to \ that for all x on 
(a, b) and for k=0, - - - , n—1, the inequalities 


| u(x,r) — u®(x,d’) | <e, !du(x,d) N’u(x,d’) | <e 
are true. We find 


1 z=2'/+h 1 z=z'+h 
lim’sup ; | | 
1 


1 
lim sup | pu ba 
h-0 h z’ 

P(x’ + h)— P(x’) 


< elim =e€P’(x’). 
h 


1 z=2z'+h 
lim | = — pul) —---— pu — 


z= z’ 


and since e can be taken arbitrarily small, we must have 


1 z=z'+h 
lim | = — 
h 


as we were to show. 


Since 
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All the usual discussions of linear dependence and independence of 
solutions, solutions satisfying given boundary conditions, Green’s function, 
and so forth, can now be repeated for the general equation described in the 
theorem.* We may note that a consideration of the linear dependence of 
solutions would lead to a sharpening of Theorem II in that the set € could 
be replaced by a set E, independent of the particular solution. 

We turn next to another type of theorem. While that which we shall now 
discuss difiers but slightly from a less general theorem of Birkhoff in its 
proof, we take the opportunity of giving a complete demonstration since 
the details of the proof can be simplified for our purposes.t 

We denote by S a sector of the type /7/n< 6 (/+1)x/n, arg p=8@, 
where /=0, 1, 2,---,2n—1. From the sector S we define a region T by 
an arbitrary translation taking the vertex into the point —c, where c is a 
complex constant. The m distinct roots of w*+1=0 can be written in the 
order w;, - , such that for all p on T 


R(p + cor) S R(p + cor) --- FS R(p + cwn) 


where the notation R( ) means, as usual, “the real part of”. We commence 
with two important lemmas. The first is 


LemMA I. On the region T of the p-plane 


< C | |, 


> Wee? 
dx* | 


a=n 
weer < C| p|*(n i) | | 
a=i+1 


fork=0,---,n—1,i=1,---,. The constant C may be taken independent 
of i and k. 

We let C be chosen so that 

| | C, a=1,---,i,i=1,--- 

| (2-8) | 2C,a=i+1,---,n,i= 
Then we have for a S$ and for p on T 


R(pwa) S R(pw; + cwi — wa). 


* For the usual existence theorems and subsequent developments, see Bécher, Legons sur les 
Méthodes de Sturm, Paris, 1917. 

{ Birkhoff, these Transactions, vol. 9 (1908), pp. 219-231. 

t Birkhoff, these Transactions, vol. 9 (1908), p. 381. 
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Thus, if 


a=i 


wa < | p|* >| | < | p| k | 
dx* a=1 a=1 a=1 


C| p|*i| | , 


as we were to show. 
Again, starting from the inequality R(pwe)>R(pw;+cwj—Wa), aZi, 
which is true on 7, we derive the second inequality by analogous means. 
In Lemma II we shall employ heavy-faced type to denote square matrices 
of x? elements.* The statement of the lemma is as follows: 


Lemma Il. JfQ(x) has all its elements real or complex continuous functions 
of x, OSx1, and 2f K(x, &, p) has all ats elements of the form p(&)k(x, &, p) 
where 
(1) p(é) 2s a real or complex summable function, OSE<1; 
(2) k(x, &, p) ts continuous in x and & for OSE<x and for OSx<ES1, 
and is analytic in p, |o| =RounT; 
(3) |k(x, p)| Sk, OSTxS1, OSES1, |p| on T; 
then there exists a matrix Z(x, p),OSx<1, |p| sufficiently large on T, 
whose elements are continuous in x and analytic in p, and which satisfies 


1 1 
Z(x,p) = W(x) + =f K(x, 
PJ 


Furthermore, 
1 
Z(x,p) = Q(x) + — E(x,p) where | E(x,p)| = M, 0 
p 


pon T. 


If U denotes the matrix all of whose elements are unity, we can write 
1 
sot, f | pu. 
0 


We consider the infinite series 


Z(x,p) = + 


* For further details of notation see Birkhoffa nd Langer, Proceedings of the Ameri- 
can Academy of Arts and Sciences, vol. 58 (1923), pp. 52-54. 


1 1 
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We see that the general term is in absolute value less than or equal to 


Thus if we take np/|p| <0@<1, the series converges absolutely and uniformly 
in x and p. The individual terms are continuous in x and analytic in p; 
hence this is true of the elements of Z. By direct computation with this 
series we find that 


1 1 
Z(x,p) = Q(x) + - f 
#0 


We now let m(p) be the maximum value taken on by the elements of 
Z(x, p) for OS$x<1. Then we have |Z(z, p)| <m(p)U. From the integral 
equation 

1 nm(p) 
| Z(x,p)| < wU + —— pm(p) UV? = U. 
|p| 
All the elements of the matrix on the right are equal; there is always some 
element of the matrix on the left which for an appropriate value of x takes 
on the value m(p). Hence 
npm 
m(p) Sw + |p| = R’. 


| P| 
Now we find the inequality 


1 
|2(2,0) —a(2)| < 
|p| |p| 
from which the rest of the lemma follows. 
We come now to 


THeorEM III. Jf fo, ---, pn are real or complex summable functions of 
the real variable x, 0 <x <1, and if pis a complex parameter, then the differential 
equation u™ + + + + (prtp")u=0 has on any 
region T of the complex p-plane, n linearly independent solutions analytic in 
P, M1, °° * , Un, expressible with their first n—1 derivatives in the form 


E; 
p 


where |Ex (x, p)| <M for allp on T, |p| 2R, and for all x, OSx 


= 
_ 
U. 
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We write the differential equation as 
u™ + p"u Prt S’(u), 


and proceed to solve this equation as though the right-hand side were known. 
Thus 


0 


where ¢i, - - - , ¢, are arbitrary constants. We now write 


= C1 (/=1,---, 4%), 


a=i+l 0 np 


1 Wal Peat 


ll 


S’(u)dé =i+i1,---,m). 


We find at once 


1 z 


aml 


1 z a=n 
+— f DX 
1 


np"—} 


a=i+l1 
Next we let correspond to &=1. 
We then put =p*er*#2, fori=1, -- Thenitis 
apparent that 
1 d* a=t 
np*— dx* 
1 da* a=n 
np™—} dx* oni 


On expressing S’(u) in terms of the z,, this equation becomes 
p q 


z 


1 a* a=i 
ik = wi* + — a + 


np. a=1 


1 z 
mp 


dx* a=i+l1 
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By Lemma I we see that this set of equations is precisely of the sort discussed 
in Lemma II. Thus we obtain solutions ™, --- , such that p) 
is continuous in x, 0<x<1, and analytic in p, |p| >R, on T, for i=1, - - - , 
n,k=0,---,n-—1, while 


Exx(x,p) 


= ad + 
p 


Ex| < M. 
The functions ™, - - - , #m are clearly linearly independent for large |p|, by 
their asymptotic form. 

It remains for us to show that the m,--- , u, satisfy the differential 
equation. We know that there exist solutions %, - - - , #, of the differential 
equation, linearly independent for all values of p; we need only apply 
Theorem II taking (0) =6;44:, 7=1,--- , m, R=0,---,n-1. By 
assigning appropriate values, analytic in p, to the coefficients 4,--- , & 
of the integral equation above, we obtain particular equations satisfied by 
th, +++, tn. We show that any solution w for a similar equation in which 
are constants can be expressed linearly in terms of - , 
and is consequently a solution of the differential equation. We write 


U = u — — +++ — Clin, 


1 z 
U = Ayers? +--+ + + f dé 
Jo ant 
1 z a=n 
dé. 
ami+1 
We see now that either (1) when the constants &,--- , & are not all 
zero, we can determine A;,-~-- , A, identically zero by an appropriate 
choice of Ci,--+-, Ca; or (2) we can take w=0 and determine Ci, - - - , 
C, not all zero so that Ai, ---, A, are identically zero. Either possibility 
leads us to study the equation last written down with the coefficients A, 
+, A, taken equal to zero. On writing U‘* Z, we find 


= —— n + 
k 4 p k a pop n—2 


a=1 


+ prZo} dé 
1 z d* a=n 
+ f > { pop” + 
mp") Jy 


peZo} dé. 
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If m(p) is the maximum value taken on by |Z;(x, p)|, 0S«<1, k=0,---, 
n—1, Lemma I enables us to conclude that 


iif | Pn | 


For some values of x and k we can always write |Z,! =m/(p), so that 


fr 


This is impossible for large values of |p| unless m(p)=0. Thus U=0. Under 
the second of the two alternatives above we should have C,#,+ --- +C,a@, 
identically zero without having), - - - ,C, allidentically zero. By hypothesis 
this is impossible. The first alternative alone is possible. Consequently 
we have u=Cyi,+ ---+C,a, as we wished to show. We may remark 
that, since the coefficients &, - - - , é associated with « were assumed to be 
constants, we can assert that Ci, - - - , C, are analytic for all p of sufficiently 
great absolute value so that # is also analytic in p. This completes the 
theorem. 

We shall need in some of our work a slightly more detailed form of the 
theorem actually stated by Birkhoff. 


TuHEeorEM III’. Jf po,---, pn are real or complex functions of the real 
variable x, OSx1, continuous together with their derivatives of all orders, 
and if p is a complex parameter, then the differential equation 


Ul) te + pour?) + + + (pp + p")u = 0 


has on any region T of the complex p-plane n linearly independent solutions 


analytic in p, %1,°-° + , Un, expressible with all their derivatives in the form 
(k) Au(=) | Eix(x,p) 
ux (x,p) = 1 
pti 


where Ay(x) is continuous together with its derivatives of all orders and 
E(x, p) is bounded for all p on T, |\p|=R, and all x on (0, 1). The func- 


tions Ay(x) may be taken independent of the particular region T. 


To demonstrate this theorem we first determine a function 


l=m , 
uy(x,p) = em? { i+ > 


l=1 (pw)! 
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with the property that when substituted in the differential expression which 
is the left-hand member in our diflerential equation it gives rise to a series 
in powers of p beginning with a term in (pw,)""""* at most. Since the substi- 
tution of pw;/w; for p in the differential expression does not change it, the 


functions #;=%,(x, pw;/w;) have the same property. Taking %,--- , 
as approximants to #,--- , “%, we then apply the reasoning of Birkhofi’s 
paper to establish the theorem for k=0, - - -,2—1. For k=n we substitute 


the asymptotic forms thus obtained in the diflerential equation; for k=n+1 
we differentiate the equation once with respect to « and substitute the forms 
previously determined; continuing thus, we establish the theorem for all 
values of k by induction. Since the same functions #, numbered in accordance 
with the ordering of the constants w, can be employed for any other region 
T, the functions A y(x) are independent of the particular region. 

We may notice that, if we determine beforehand the integer and the 
greatest value of k which we wish to employ, the functions ps, ---, px 
do not need to be taken as continuous with continuous derivatives of all 
orders in applications of this theorem. For the sake of simplifying a dis- 
cussion sufficiently complicated in other ways we make this more restrictive 
assumption. 


Il. THE GENERAL PROBLEM 
The general problem which we shall discuss arises from a linear homogen- 
eous differential system of the zth order: 
Wi(u) = 0, 


0, O<x<1, 


0, 


where W,(u), - - -, W,(2) are linearly independent homogeneous linear forms 
in u(0),---, w-)(0), w(1), - - -, with real or complex constant 
coefficients. There is no loss of generality in restricting attention to the 
unit interval. As we have already pointed out in §I there is associated with 
this system a Green’s function except when the boundary conditions are 
identically satisfied by some solution of the differential equation; this 
Green’s function will have the essential properties of the Green’s function 
defined in the case of an equation with continuous coefficients. We now make 
the assumption that the Green’s function G(«, y; \) has infinitely many 
poles in the \-plane, i, Xe, - - - , which can be arranged so that |A\| <|» 
<---,lim,..|A,|=0. We let R,(«, y) be the residue at \=X,; considered 
as a function of x, this residue satisfies the differential system for \=X,. 
We next define a system of circles C;, C2, - - - , with centers at \=0 and radii 
Ax, As, - - - forming a monotone sequence with limit +. We may suppose 
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further that the region between two consecutive circles of the system contains 
at least one pole of G and as few others as possible. No pole shall lie on any 
of the circles C. For any function f(x) summable on (0, 1), and for that branch 
of the function [1 —(A*/A,*)]**! reducing to 1 at the origin, we consider the 
behavior of the integrals 


1 oF 1 r4 k+l 
1 — G(x,’ ; A)ddd 120, 
2ri dx* fof ( 


fro f \)dddy, 


as ve. By the theory of residues these expressions are respectively 


a=n, 4 k+l 1 k 
Ad 0 
> {1i- f R.(x,y)dy, 
0 


a=1 


a=ny 


1 
SO) Ray, 
a=1 0 

where n, is the number of poles of the Green’s function in the circle C,. 

To compare these expressions, suitably restricted in the case k ¥0, with 
corresponding expressions in the theory of Fourier series we construct a 
differential system of order m for which the integrals and series just described 
become Fourier series. We then study the difference of the corresponding 
expressions formed for the two differential systems. 

We shall, in the present paper, restrict ourselves to those differential 
systems for which the boundary conditions W,, - - - , W,, after being reduced 
to their normal form, are regular according to Birkhofi’s definition.* The 
differential system from which we obtain the Fourier series will be of this 
type. At a later time we shall come back to the irregular cases, n=2. 

Our fundamental tool for this investigation is a theorem due to Lebesgue.t 
It is sufficient for our purposes to quote it, with minor modifications. 


THEOREM IV. Let there be given a function (x, y, v) defined for x and y 
on the interval (a, b) and for real positive values of v belonging to a set N one 
of whose limit points is + ; and let @ be summable in y for each pair of values 
(x, v). Then a sufficient condition that, as v becomes infinite in any manner 
in N, 


lim f =0 


* For the terms normal and regular, see Birkhoff, these Transactions, vol. 9 (1908), pp. 
382-383. 
t Lebesgue, Annales de Toulouse, ser. 3, vol. 1 (1909), pp. 52-55. 
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uniformly for all x belonging to a set E on (a, b), f(x) being any summable 
function, is that 

(1) |o(x, y, v)| <M for all v in N and for all x in E, except for values of y 
on a Set E’ of zero measure; 

(2) lim, y, v)dy=0 uniformly for x in E,asa<Bsb. 


Lebesgue also shows these conditions necessary, but we shall not use this 
fact. 


III. Fourier SERIES 


We find in the following theorem a generalisation of facts well known in 
the cases n=1 and n=2: 


THEOREM V. The expansion problem associated with the differential 
system 


u™ + dru = 0, 


u-)(0) — u-Y(1) = 0, 
in which the boundary conditions are regular, gives rise to Fourier series. 


The regularity of the boundary conditions is a matter of direct computa- 
tion which we omit. For the case where m is even this has been carried 
through elsewhere.* 

We first determine the values of \ for which the differential system has 
solutions not identically zero. When A =0, it is clear that the only solution of 
the system is the solution «=constant. When \+0 we make the substi- 
tution \=p". Choosing » linearly independent solutions of the differential 
equation, #,;=e°*1*,--- , u,=e’*n*, we find all the other characteristic 
values of \=p” from the roots of the equation 


ui(0) — m(1) + — (1) 


where 2, is a constant not zero. The roots of this equation other than p=0 
are p=2vmi/w,, k=1,--- , nm, v=+1, +2,---. The corresponding 
values of are A= — (2y77)", +1, +2,---. 


* Tamarkin, Rendiconti del Circolo Matematico di Palermo, vol. 34 (1912), p. 359. 


u(0) — u(1) = 0, 
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Hence the Green’s function G(x, y; \) exists for this differential system 
We consider separately the two cases n=2u—1, n=2yp. 
Case I. n=2u—1. Corresponding to \=A,= —(2v7i)", G(x, y, has 
a residue R,(x, y); and the differential system a solution ernie fe +1, 
We use these facts to obtain a development of R,(x, y), considered 
as a function of y, in Fourier series. By a fundamental property of Green’s 
function we have 


0 


We divide this equation by 27i(A—X,-) and integrate both sides of the result- 
ing equality with respect to \ over a small circle surrounding the point 
A=X,. We obtain 


1 
(x, y)dy =- 
0 lo, 


If we form the corresponding equations for —v’, add them to those just 
written down and divide by two, we tind 


1 1 erriz 
f cos =4 
0 \ 0 


In the same way 


f sin 2v’ryR(x,y)dy = 
0 


Hence we see that R,(x, y) =e?”"*=-», yp=0, +1, +2,---. If now we let 
Co be a circle with center at the origin in the \-plane, including no other pole 
of G(x, y;), C: a concentric circle including also \1, A-1, and so on, we have 
to consider the expressions 


k+l 


fo f Gly, 5 


, we tr 
= +p 
9 
y=—p}. 
0 
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On writing A, =(27w,)" and denoting by [w,] the greatest integer less than 
w,, we see that the first integral reduces to 


1 a=[w,] kt 1 ok 
f fy)dy+2 (1 =~) f 60s 2ow(x — y)dy, 
0 0 ’ 


a=1 wi” 
and the second to a similar series. We recognize the Fourier series for the 
interval (0,1). 
Case Il. n=2y. For X\=\A,=—(2v7i)", v=0, 1, 2,--- , the Green’s 
function has residues R,(x, y); and the differential system has solutions 
cos 2y7x, sin 2vrx. From the identities 


1 
(A — Ay) f cos 2v’ryG(x,y ; A)dy = cos 2v’ry, 
0 


1 
(A — Ay) f sin 2y’ryG(x,y ; A)dy = sin 2v’rx, 
0 


we obtain, as in Case I, the Fourier development of R,(x, y) considered as a 
function of y. It turns out that 


Ro(x,y) = 1, R(x,y) = 2cos2ver(x — y), v=1,2,---. 


As in Case I, the expansion problems lead us directly to Fourier series for 
the interval (0, 1). 

To make clear the nature of the expressions involving factors of the form 
|1 — (a*"/w") |*+! we give references to the literature from which the following 
theorem may be proved: 


THEOREM VI. Let uo(x)-+1;(x)+ue(x)+ - - - be an infinite series whose 

terms are real functions of the real variable x, aSx<b. Then the expressions 

a=0 
where 2=w", A>0, 5=0, are the Riesz typical means of the first kind of order 
5 and type a for the infinite series in question. If limg.. Sij* exists and is 
equal to U(x), the infinite series is said to be summable (a’,5) at the point x to 
the value U(x). Summability (a, 5) implies summability (a*’, 5) to the same 
value; if the summability (a*, 6) is uniform for a certain range of values of x, 
then the summability (a, 5) is also uniform for that range. Likewise, summa- 
bility («*, 5) implies summability (a*, 6’) to the same value if 6’=6, with similar 
remarks on uniform summability. The means Sq'(x) are equivalent to the 
Cesdro means for integral 6 and their generalisations for arbitrary 5; that is to 
say, summability (a, 6) implies summability C(6) to the same value, and con- 
versely, with remarks on uniform summability like those previously made. 


z 
z 
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Riesz’s typical means are discussed in some detail in a joint work of Hardy 
and Riesz,* where further references are given.{ The series there considered 
are series of constants; but the extension of the methods employed to 
questions of uniform summability when series of functions are involved 
requires merely appropriate modification of Lemmas 4, 5, 7, 8. Theorem 
16 gives the relation between summability (a*, 5) and summability (a*, 5’) 
when 6’26. The relation between summability (a*, 6) and summability 
(a*’, 8) is a special case of the theorem stated in the second foot-note on 
page 33; a proof of this fact could be modeled on that of Theorem 17. The 
comparison of summability (a, 6) with summability C(6) follows a paper of 
Riesz.t We add the 


Corottary I. Summability (a*, 6) and summability C(6) are completely 
equivalent with respect to summability at a point and uniform summability 
over a set of points; each implies the other. 


The importance of this corollary is due to the fact that the known results 
concerning the Cesiro summability of Fourier series and its term-by-term 
derived series can be carried over to summability by the Riesz means, which 
are better suited to the form of Fourier series which we shall discuss under 
Theorem V. We shall then be able to carry these results over to the general 
Birkhoff series by the intervention of this corollary and Theorem V. 


IV. BIRKHOFF SERIES FOR n=2yu—1 


Distinct differences between the case when n is odd and the case when 
n is even make it necessary to consider each case separately. The methods 
employed in the two cases are virtually the same; we therefore give a com- 
plete study of the case »=2y—1 and then describe the necessary changes 
in mode of attack and in statement of theorems in the case n =2y. 

By our hypotheses, the differential system considered is 


+ + pou?) +---+ (pr, = 0, 
Wi(u) = 0,---, Wi(u) = 0, n= 


* Hardy and Marcel Riesz, The General Theory of Dirichlet’s Series (Cambridge Tracts, No. 18, 
1915), Chapters IV and V. 

7 See especially Marcel Riesz, Sur les séries de Dirichlet et les séries entiéres, Comptes Rendus, 
vol. 149 (1909), pp. 909-912. 

t Marcel Riesz, Une méthode de sommation équivalente a la méthode des moyens arithmétiques, 
Comptes Rendus, vol. 152 (1911), pp. 1651-1654. 
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where 
W (u) = + Bw 
ki kite, 

and ai, Qn, Bi, Bn, ki, , kn are subjected to certain conditions 


in accordance with the definition of regularity.* On making the substitution 
=p" we are led to consider instead of the entire \-plane a corresponding 
sector of the p-plane, which we shall take as composed of two adjacent 
sectors S: R(pw:) = R(pwe)S - -- SR(pw,), arising, of course, from distinct 
orderings of the constants w; this region we call ©. The sector S is a region 
T of §I for c=0. We recall now the facts concerning the asymptotic dis- 
tribution of the values of p giving rise to the characteristic values of X. 
On an arbitrary sector S these values of p lie asymptotically near points 
equally spaced at distance 27 along a line parallel to the bisecting ray of 
S; there is one and only one value of p near each of these points when |p| 
is large; and the corresponding values of \ are simple characteristic values 
of the differential system. f 

We wish to compare any two given differential systems of order n=2y—1. 
From the p-plane we remove all the points which give characteristic values 
of one system or the other of the two which we are comparing; we do this 
by removing the interiors of circles o of arbitrarily small radius ¢e, one 
described about each such point as center. The portions of S, = which 
remain we denote by S’, =’. We then denote by I any circular arc on 2’ 
with center at the origin terminated by the rectilinear portions of the 
boundary of =’. The image of I in the )-plane is a circle C; the totality of 
such circles C forms an infinite set of concentric annular regions, none of 
which contains a characteristic value. From the behavior of the characteristic 
values for large |A| we see that the large circles of the set Ci, C2, C3, - - 
as described in §II can be selected as circles C; this is true simultaneously 
for the two differential systems we are considering. 

The region S is such that on itf 


R(pw:) S R(pwe) S --- S R(pw-1) — B< 0; 
R(pwun) = 0 on the bisecting ray of S; 
R(pw,) R(pwy+1) 28> 0. 


* Birkhoff, these Transactions, vol. 9 (1908), p. 383. 
1 Birkhoff, these Transactions, vol. 9 (1908), pp. 383-386. 
} Birkhoff, these Transactions, vol. 9 (1908), p. 384. 


714 M. H. STONE [October 


We therefore denote by S, that part of S for which R(pw,) £0; the part for 
which R(pw,) 20 by S:. The notations S,’, S:’ can be introduced according 
to the principle previously employed. The portions of T on S’, S,’, S2’, 
we shall call y, 71, y2 respectively. The common radius of these arcs we 
denote by R. 

We now prove several preparatory lemmas. 


Lemma III. Jf m (x, y, p) is a function of x, y, and p bounded foraSx <b, 
asy3b, and for all p on S’ such that 0’S arg p<0"'; and tf a is any arc of 
+ between 6’ and 0’, then the integral f{.m(x, y, p) dp/p is a bounded function 
of x, y,asysb, and R. 


We have 
dp 
if m—\|s uf do = M(6” — 
a p a 


Lemma IV. If m/(x, p) is a function of x and p bounded for aXx<b, 
0<a<b<1, and for all p on S’ such that 0’ = arg p<0"’, then 


lim | p*e'*m(x,p)dp = 0 (i --+,p—1), 


lim | = 0 (i 


Rox a 
uniformly, 0<aSx<b<1, for any positive k. 
We let 6 be the lesser of a, 1—d. Then 
| 
f p*eizmdp | UR = — 6’) 
Ja @ 


when i=1, - - -, w—1; and 


f ndp| < 6’) —, 0 


when i=y+1,---, 2. 


Lemma V. If m(x, p) is a function of x and p bounded for 0<asx<b<i, 
and for all p on Sj, then 


k+l 


for any k=0, (1) is uniformly bounded for all a on y; and all x on (a, b) if 
l=0,and (2) approaches zero uniformly as R>~ if l>0. If m(x,p) is a function 


= 
= 
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of x and p bounde for x on (a, b) and for all p on S,' similar statements are true 


of the integral 
p*{1—— (x p)dp. 
Ri" 


In the case of the first integral we introduce an angle ¢ measured from 
the bisecting ray of S as initial direction, with the positive sense in S}. 
We then see that we can write 


pw, = 
0 <= ¢/2 sing, 
0 < | 1 — [etl < 


We find 


| p*” k+l 
p* (1 -- 
Rin 


if R**1(1 — Resing gikxcos om dh 
lJ 


/ 


< uf | 1— einie < un 
0 0 


k+l 
gktle 


This establishes the first result. The second integral can be treated in pre- 
cisely similar fashion. 


Lemma VI. If m(x, y, p) is bounded OS x51, for all p on S, 


then 
m(x,¥,p) 
f f evi (z—u) m(x, dpdy 
a p 


approaches zero uniformly, andi=p+l,-: - 


O0<x<a<1; and 
z m(x,¥, 
f f ere P) dpdy 
a p 


approaches zero uniformly, i=1,---,u—1, 0SaSxS1, and i=yp, -- - 


715 
2n 
= = 
2n 
| 
{ 
MN MN 
Jo Jo 
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We have when 7=1,- - -.,u—1,0Sasx<l, 


m 
f f erwi(z—y) <= f f Mdody 
71 p @ 


1 — aM 
< ~ 


Similar reasoning applies when 7=y+1,---,”,O0Sx%Sa<1. The integrals 
over 2, 1~p, are discussed in the same way. 
When i =» the treatment is slightly different. ForO<a<x<1 


s m z 
J f dpdy| f Mdgdy 


dg 


z= 4 — 
uf dy 
a R(x y) R a@ 


M M 
— = O (log R) 0. 


RJo 
By analogous means we obtain the corresponding result for the integral 


over 
We can now demonstrate 


TueoreM VII. If G and G are the Green’s functions for any two regular 
differential systems of order n=2y—1, then for any region S’ the integral 


nl 
; p") — G(x, y ; fdp 


is bounded for all y on S’, OS 1, forl=0. 
We first recall the explicit form of the Green’s function, which we know 


to exist in the present case. We have 
u(x) + Un(x) g(x,y) 


( — 1)" W,(m) - 
G(x,y ; 
1(1t1) + + W if Un ) 


W,, (1) W,( Un ) 


= 
= 
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where ™,--- , “, are linearly independent solutions of the differential 
equation and 


1 i=n 1 i=n 
i=1 i=1 
the notation {A; B} indicating that A is to be taken if x>y, B if xy, 
and the functions 1, - - - , v, being defined by the identity in 71, --- , tT. 


u(y) + + tta(y) 


+ tataly) = 


We shall take ™, - - - , u, as the functions defined in Theorem III for the 
sector S. It is then possible to determine the asymptotic form of the Green’s 
function on S,’and S,’ in sufficient detail that the present theorem is apparent 
from the lemmas just established. 

Case I. ponS,;’. We multiply the first 1 columns in the numerator of 
G by - - - , respectively, the next columns by —}v,4:(y), 


- , —4v,(y) and add to the last. We then take the first term in the new 
last column outside the determinant. The result of substituting the asymp 
totic forms of - , %1, , in the expression thus obtained is* 


i=u 


i=n Ai 
np"*'G = { epwi(z—y) w; | 
where A; is a determinant of order n+1, in which the element a,;, h=0, 
-++,m,j=1,---,n+1, may be described as follows: 


= 0; 
and fork=1,---,n” 


Ong = = 1, 15 = + ; 
i=u 


i=1 


i=n 
t=—p+1 


* Birkhoff, these Transactions, vol. 9 (1908) pp. 389-395. 
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The notation [ | is used to indicate the leading term of the asymptotic 
form in question. The exponential terms in the determinant A, are bounded 
for all x and y on (0, 1) and for all p on S,; and 1/ {[@0]-+e**{6,]}, in which 
6) 0,+#0 by the regularity of the boundary conditions, is bounded for all 
p on S;’.* Hence we are able to write 


4n\ l i=u i=n 
i=1 


mi(x,¥, mi(x,Vv,p)) 


i=1 p i=p+1 p 


+ i(x,y,p) + M (x, 


i=1 i=y+1 


where # and M denote functions bounded for all x and y on (0, 1) and for 
allp on S,’. Thus, by Lemmas III, IV, V we see that 


ini l u n 
R** i=] i=u+l1 


is bounded OS y<1, 0<aSx<b<1, for all y; on S,’. The application of 

Lemma III, it should be observed, is made to the bracket term on the right, 

which is clearly of the form m/p for all « and y on (0, 1) and all p on S;. 
Of course, in the result just obtained we may replace G by G; a combina- 


tionof the facts thus established shows that the integral 


p** l 
f (1 --) G —G)dp 
y Rin 


1 
is bounded OS y<1, for all y; on S;’, 1/20. 

Case II. pon S;’. We now multiply the first ~—1 columns in the nu- 
merator determinant of G by 3u(y),--- , 40,-1(y) respectively, the next 
uw columns by —4v,(y), - - -, —30n(y), and add to the last. We then continue 
as in Case I, finding 


i=u-1 


i=n Ao 
where A, differs from A; in Case I in having the definition of a,, replaced 
by 


i=1 


doy = any = + h=1,---, m, 


* Birkhofl, Rendiconti del Circolo Matematico di Palermo, vol. 36 (1913), pp. 120-121. 


p 
R 
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and in having the sums in the definition of a4 .n4:, #>0, extended from i=1 
to i=yu—1 and from i=y to i=m respectively. The exponential terms in 
the determinant A, are bounded for all x and y on (0, 1) and for all p on 
S»'; and 1/{[@o]e-¢*«+[6,|} is bounded for all p on S,’. The reasoning now 
proceeds asin Case J. Thus we see that the integral 


pi” l 
(G — G)dp 
J. Rin 


is bounded O< y<1, 0<asx<b <1, for all y2 on S,’, 1/20. 

By combining the facts in Cases I and II we obtain the theorem as stated 
for the integral over y on S’. Since S’ is arbitrary the result holds also for 
the arc T' on 2’ and can be interpreted in terms of the circles C in the \-plane. 

We next demonstrate 


THEOREM VIII. Under the hypotheses of Theorem VII, the integral 


8 l 
f f G(x,y ; p") — G(x, »"))dpdy, 
a 


where 120 and has the limit zero as uniformly 0<asx 
<b<i. 


We split the integral over (a, 8) into the sum of the integrals over (a, x) 
and (x, 8); since the last two integrals are of the same form we consider but 
one of them. 


Case I. pony:. We employ the asymptotic form of obtained in 
the preceding theorem. In the determinant A, the variable y occurs only 
in the last column so that the integration with respect to y can be performed 
directly in the determinant. We see at once that 


a m — epwi(l—a) 
f + Nay = 
z p pw; 


m m(x,0,p) 
f ervil—v) — dy 
p 


for all x and a on (0, 1) and for p on y;, i=1, - - - , w; and that 


m 
z p 


for all x and @ on (0, 1) and for p on y1, i=u+1, - - - , a. Hence we see that 


m(x,a,p) 
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i=n 


a 4nv l i 
z 71 


= f f { eret(s—9) (x, p) s+ y) (x, pay 
z v1 p 


p 


1 i=u i=n 
i=1 i=y+1 


The two expressions on the right of this equation approach zero uniformly 
as Roo , 0<asx<b<1, by Lemmas IV, V, VI. 

We can replace G by G in this result; and we may write 6 instead of a. 
Thus it follows that 


Jim 22) 5 0”) Gary = 0 
a v1 


uniformly 0<a<x<b <1. 

Case II. pon yz. We must use here the asymptotic form of np*"'G 
employed in Case II of the preceding theorem. We separate the integral 
over (a, 8) into two parts as in Case I, and then perform the integration 
with respect to yin the last column of the determinant A,. On expanding the 
determinant we have 


a l i=u—1 


p p 


1 i=u-1 i 
R t=1 

By Lemmas IV, V, VI, the expressions on the right approach zero uniformly 
as 0<as<x<b<1. Thus, asin Case I, 


l 
jim (1 (G — G)dpdy 


uniformly, 0<asx<b<1. 

From Cases I and II we conclude the truth of the present theorem. By 
the arbitrariness of the sector S’ the theorem remains true if the integration 
with respect to p takes place over I. 


Ej 
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From the two theorems just established there follows 


THEeoreM IX. [f f(x) is any summable function defined on (0, 1) and if 
l=0, then 


R 


1 nl 
lim frof (G(x,y ; p") — G(x,¥; p"))dpdy = 0, 
0 


1 l 
lim frof (1 (G(x,y ; — G(x,y; d))dAdy = 0, 
Je c A‘ 
uniformly, 0<asx<b<l. 


We see at once that Theorems VII and VIII together give us the sufficient 
conditions of Theorem IV and therefore establish the present theorem with 
regard to the first integral. Since the sector S’ is arbitrary the arc y may 
be replaced by I in the first integral; and then a return to the variable 
gives the second integral. 

Theorem IX clearly yields a large amount of information concerning the 
behavior of the expansions arising from two distinct regular differential 
systems of order n=2u—1. There is one point, however, which we desire 
to examine before stating our final theorems on the comparison of these 
expansions. If we let Na, Na be the number of residues of the two Green’s 
functions G and G respectively arising from poles in the circle C of radius 
A, we clearly have |Na—WNa| << independent of A, though it is not necessary 
that Na=Na. Thus the expansions determined by the integrals over C 
are not compared term-by-term in Theorem IX in general. In order to 
compare the sums given by the first NV residues in the two cases we clearly 
need to obtain a generalization of the theorem of Riemann-Lebesgue, familiar 
in the theory of Fourier series: we need to show that the sum of any limited 
number of terms of the expansion becomes arbitrarily small when each term 
of the sum arises from a pole of G sufficiently removed from the origin. This 
is the aim of the theorems which succeed. 


THEOREM X. The residues of the Green’s function for a regular differential 
system of order n=2u—1 are a set of functions of x and y uniformly bounded 
for all x and y on (0, 1). 


On the region S’, np"-'G(x, y; p") is uniformly bounded for all x and y 
on (0, 1), as we see from the asymptotic forms discussed in Theorem VII. 
Now we can compute any desired residue of the Green’s function by evaluat- 
ing the integral 
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1 
R(x,y) = — f np™"G(x,y ; p*)dp 


taken over the circumference of the circle o of radius e described about the 
corresponding pole of G in the region 2; the circles o remote from the origin 
lie entirely interior to a region S. Thus we find |R,(x, y)|<Me for all x 
and y on (0, 1), v=1,2,---. 


THEOREM XI. Under the hypotheses of Theorem X, the integral 


1 
f f np""G(x,y ; p")\dpdy = f R(x, y)dy 
2rida Je a 


approaches zero uniformly, 0<x<1, as the center of the circle o recedes in- 
definitely from the origin. We suppose O0Sa<fP<1. 


As in Theorem VIII we divide the integral over (a, 8) into integrals over 
(a, x) and (x, 8), of which we consider only one in detail. We then show that 


on S’ we have 
m(x,a,p) 
f np"'G(x,y ; p")dy = 
z p 


f np""'G(x,y p")dy| S 


where R’ is the shortest distance from the origin to the circle c. 

Case I. p on S,’. We refer once again to the asymptotic form of 
np"'G(x, y; ep") givenin Case lof Theorem VII. We perform the integration 
with respect to y directly in the determinant A, as in Theorem VIII and find 
that the contribution of this term is of the form m(x, a, p)/p, just as before. 
It remains to treat the bracket terms in the asymptotic form. We have 


a 1 epwi(z—a) ms; MAX,a 
f epwi(z—y) (« dy +f epwilz—y) dy = mi(x,a,p) 
p p 


It follows that 


p p 
for p on S;’, OSaSx51,i=1,---,p; and 
mi m(x,a,p 
f epwi(z—y) (« dy = op) 
z p p 
for p on S;’, OSxSa<1,i=y+1,---,m. Hence, the bracket terms also 


contribute an expression of the form m(x, a, p)/p, p on S;’, for all x and a on 
(0, 1). Consequently on S,’ we have 


f ; p")dy = 


m(x ,a,p) 
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Case II. pon S,’. The procedure is the same as in Case I except for the 
difference of the two sectors with regard to the constant w,. 

As we have already pointed out, the theorem flows at once from the results 
of Cases I and II. 

We now obtain the generalised theorem of Riemann-Lebesgue: 


THEOREM XII. If the hypotheses of Theorem X are fulfilled, and if 
f(x) is any summable function on (0, 1), then 


1 1 
f np" G(x, ; p")dpdy 


approaches zero uniformly, 0<x <1, as the center of the circle o recedes in- 
definitely from the origin. In other words, if R,(x, y) is the residue of G(x, y;) 
corresponding to the pole X= ,, then 


lim f(y) R(x, y)dy = 0 


uniformly, 0<x 1. 


The theorem is established at once by comparing Theorems IV, X, XI. 
We can now make the following fundamental assertion :* 


THEOREM XIII. On any closed interval (a, b) completely interior to 
(0, 1) the term-by-term difference of the expansions formed for any summable 
function f(x) in connection with two regular differential systems of order 
n=2u—1 converges uniformly to zero. These series we shall call Birkhoff 
series of order n=2u—1; and we may restate the preceding result, saying that 
any two Birkhoff series of order n =2p—1 are equivalent on (a,b). In particular, 
Birkhoff series of order n=2p—1 are equivalent to Fourier series on (a, b). 


The material on which the proof rests is to be found in Theorems V, 
IX, XII. If we take ]=0 in Theorem IX we find that 


1 1 
= f G(x,y — G(x,y ; 


gives the difference of Na terms of one series and Na of the other, and 
approaches zero uniformly on (a,b) as A>. By Theorem XII we can re- 


* Tamarkin, D. E., Chapter V, has shown the truth of the last statement here, without actually 
discussing the term-by-term difference of the two series; his statement of the result is found in 
Lemma 6, where we have p=1, =1/n, g=2z, =1, 


roe 0 
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move a limited number of terms from each series without affecting the uniform 
convergence of their difference to zero, provided that the terms removed 
correspond to poles of the Green’s functions which recede indefinitely from 
the origin. Because of the regular distribution of the poles of the two Green’s 
functions, we can always do this in such wise that we obtain the term-by- 
term difference of the first NV terms of the two series, which therefore con- 
verges uniformly to zero on (a,b) as N becomes infinite, passing through all 
positive integral values. Theorem V tells us that the Fourier series is a 
special case of the Birkhoff series of order »=2u—1, and therefore justifies 
the last statement of the theorem. 

The consequences of this theorem are so important that it is worthwhile 
to describe them in detail. The theorem means essentially that on any 
interval (a, b) interior to (0, 1) any two Birkhoff series of order n»=2y—1 
have exactly the same behavior and both behave like Fourier series. In 
particular, properties of Fourier series such as uniform and non-uniform 
convergence, divergence, oscillation, uniform and non-uniform summability, 
are carried over directly to Birkhoff series. The various tests for the conver- 
gence and summability of Fourier series apply also to Birkhoff series. 
We conclude the existence of continuous functions whose Birkhofi series 
converge non-uniformly or diverge. Gibbs’ phenomenon is common to 
Fourier series and Birkhoff series. The theorems of Fejér-Chapman and 
Lebesgue-Hardy are true for Birkhoff series. Another important fact 
which we may observe is that the behavior of Birkhofi series at any interior 
point of (0, 1) depends only upon the nature of the expanded function in the 
neighborhood of that point. 

It is also in place to note that, for the class of all summable functions, 
this result is the strongest that can be obtained. The equivalence cannot 
be extended to the entire interval (0, 1) as can be shown by an example.* 

In the preceding paragraphs we mentioned the theorems of Fejér- 
Chapman and Lebesgue-Hardy, which are concerned with the Cesaro 
summability of Fourier series. It is not without interest to apply our methods 
to prove these theorems. To accomplish this we need a lemma whose truth 
is suggested by facts from the theory of Bessel functions. 


Lemma VII. The function 


+1 


1 
= (1 — = 2f (1 — 
1 0 


* Stone, these Transactions, vol. 26 (1924), pp. 335-355, § II. 
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satisfies the inequalities 
| (a) | <K, 0Sa1; |#(e)| a21, 


where K is a suitably chosen positive constant. 


The first inequality is obvious; the second we demonstrate by contour 
integration in the complex ¢-plane. We may describe the contour as the 
boundary of the simply-connected closed region obtained from the rectangle 
with vertices —1, +1, +1-—oai, —1—ai,a>0, by the introduction of a cut 
along the negative axis of imaginaries from —i to —oi. In this region the 
function (1—¢*)'e~‘*¢ is analytic, except at —1, —7z, +1, and continuous; 
at the origin it has the value 1. By Cauchy’s theorem we have 


—1 +1 l—oi —ot 
+ f ya = 0. 


On writing ¢=1—it/a we find 


a 


1-oi 
f (1 
1 


IIA 


+ 6¢ + + #)'e-‘dt 
Jo 


fora=1i,o>1. Similarly, if we set ¢=/—o7 we obtain 


1 
f 
0 


+ (1 0 


| (1 | 


l—oi 


IIA 


as a0, fora2=1. Likewise by the substitution ¢= —i—it/a we see that 


J 


a 


+ 6t + 40 + #)'e-tdt < Ka“*1/4, 
0 


iA 


fora21i,a0>1. Hence, on allowing o to become infinite in these three terms 
and on treating the last three in the same manner, we find 


| (a)| S$ 


as we were to show. 
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We can now establish 

THEOREM XIV. The Fourier series for an arbitrary summable function 
is summable (C, 1), 1>0, to that function almost everywhere.* If the function is 
continuous, then the summability is uniform, 0<asx<b<1.t 

Taking the differential system u’+du=0, u(0)—u(1) =0 we are to show 
that if f(x) is summable 

1 x4 
tim ff = fo) 
Cc. 


pow 25% 


almost everywhere on (0, 1). We adopt the notations 


F(a) = fe + 2) + f( 2+ a) = f | 
0 
We choose any value of x for which 
lim &(z)/z = 0, O<x< 1. 


By a theorem of Lebesgue these values of x form a set of measure one.} 
Theorems IV, VII, VIII show us that the only significant contributions of 
the — are 


+> to; | drdy 


/2 


3x /2 


where we have recalled se fact that \=A,e” on y. Henceforth we shall let 
A be a continuous variable. In the first integral we set ¢= —ie’*, 2=x—-y, 
and in the second ¢ = —ie”, z=y—x. There results 


f( — 2+ x)A®(Az)dz +7 Ie + x)A&(Az)dz. 


* Hardy, Proceedings of the London Mathematical Society, ser. 2, vol. 12 (1912), pp. 
365-376. 

¢ Chapman, Proceedings of the London Mathematical Society, ser. 2, vol. 9 (1911), 
pp. 369-409. 

t Lebesgue, Lecons sur les Séries Trigonométriques, Paris, 1905, §50, p. 96. 
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If we take A Sx, 1—x we have as A> 


< f | f(-—z+2)|dz-0, 
A 


f — + 


f(z + x)A®(Az)dz 


1-z 
< f | f(z + x)| dz 0. 
A 


A 


Furthermore, we see that 


1 1 £4 17 sin AA@ 
A®(Az)dz = -f f (1 — ¢$*)'A cos Azodzdd = (1-99) do. 
2m, 0 TJ 0 J0 T/0 


The last integral, essentially the well known Dirichlet integral, has the 
limit 3. To prove our theorer:, then, it is sufficient to show that 


A 
F(z)A®(Az)dz = 0. 
We can establish this by following Hardy’s discussion of a precisely similar 
integral. In his treatment, the essential properties of the function corres- 
ponding to our function @ are those which we enumerated in Lemma VII. 
This theorem was demonstrated by Lebesgue in the case /=1, and by Hardy 
in the more general form. It is not difficult to see that when 0<aSx%5)<1 
and f(x) is continuous, the steps of the proof can be carried through in the 
same manner, with the additional result that the convergence is uniform in 
x because of the fact that 
lim $(z)/z = 0 uniformly, 0<aSx<56<1. 
20 

This result, due to Fejér for /=1, was established in the general case by 
Chapman. 

In our preceding theorems we have compared series only on an interval 
(a, 6) completely interior to the interval (0, 1) on which they are defined. 
If we wish to consider the entire interval (0, 1) we find it possible to do so 
when we compare the series arising from differential systems sufficiently 
alike. It is sufficient to restrict the boundary conditions so that ai1=a, - - -, 
On = Bi=Bi, Bu=Bn, ki=hi, - - - , kn=kn, as we shall see presently. 
We first introduce new lemmas which serve as useful tools in studying the 
questions thus raised. 


Lemma VIII. On any region S’ 


[00] + er = [1] if 6. = 5,0: 


[G0] + 


| 
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On S,’ we see that 
([00] + — + = [0] + [0], 
whence 
[00] + _ [0] 
[80] + [50] + [81] 
Similarly, on S,’, we find 
+ + [01] = [1]. 
[Bo] + [Bolero + [01] 
Lemma IX. The integrals 


= [1]. 


me, y,p) 


nix ) 


n), 


m(x,V,p) 


m(x,y, 


n), 


where |m(x, ¥, p)|<M for all x and y on (0, 1) and for all p of sufficiently 
large absolute value on S;’ or S2', as the case may be, converge uniformly to zero 


as Roo, 


Leaving aside for the moment the integrals for which i=y, we may take 


as typical the following procedure: 


f f e dydp |= fe wWdy 
p a 
aM 
= ——— | — | < —_ 
2BnR BnR 


In the two integrals where i =u, we operate as follows: 


f f 


z 1 am (1—z)R 1 
R(i — y) (1-a) R 


<M 


m 
dp 


R1i— 
0 


The second integral with i=y is discussed in an analogous manner. 


—0. 


J 
as| 
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We can now prove 


THEOREM XV. If Gand Gare the Green’s functions for any two regular 
differential systems of order n=2n—1 for which Bi 
++, Ba=Bn, ki=hi, kn=kn, then for any region S' the integral 


f ;p")— G(x,y ; p"))dp 
4 


is bounded for all y and for all x and y on (0, 1). 


We must consider separately the integrals over 7: and 72. 

Case I. pon S;’. We refer to the asymptotic forms of mp"-' G and 
given in Theorem VII. If in the form for mp*-!G we expand the deter- 
minant A, we are led to a sum of terms each of which is the product of a 
certain number of exponential functions, bounded for all p on S; and for 
all x and y on (0, 1), multiplied by an expression of the type [A]/ {[60] 
+e**[6,]}. In p"-'G the term containing the corresponding exponentials 
has the coefficient [A]/ where by hypothesis A=A, 
6,=6;. If we subtract the corresponding terms, the resulting coefficient 
becomes, by Lemma VIII, 


[4] 


[40] + [G0] + [0] + 
for all p on S,’ and all x and y on (0, 1). Now each product of exponentials 
from the determinants A,, A, contains exactly one of the functions 


If we group the terms in the difference mp*-(G — G) according to the exponen- 
tials in y which they contain, we see that we can write 


— G) =< — pw i (x—y) x,y ,y,p) 
— G) e s+ 
t=1 p p 
> epwi(l—y) Mnsi( P) +4 X »P) 
t=1 p i=p+1 p 
m(x,y,p) 
p 


where the functions m are bounded for all p on S;’ and all x and y on (0, 1). 
The application of Lemma III to the last expression proves that 


f y; p") — G(x, y ;0"))dp 


is bounded for all +; on S,’ and for all x and y on (0, 1). 
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Case II. ponS/. We now find for the difference mp*-!(G — G) an expres- 
sion like that of Case I, different only in having the sums extended from 
i=1 to i=y—1 and from i=y and to i=m respectively. The desired result 
for the integral over ‘72 is then established at once; and the theorem follows 
at once. 


THEOREM XVI. Under the hypotheses wf Theorem XV, 
im ; p") — ; p"))dpdy = 0 
7 


uniformly for all x on (0, 1). 


We take the integral over (a, 8) as the sum of integrals over (a, x) and 
(x, 8). Of these we consider but one. 

Case I. pon S;’. From the expressions obtained in Theorem XV we 
see that 


f np"-\(G — G)dpdy 


p t=y+1 


a 


i=1 p 


uniformly, 0<x<1, 0<a<1, as R->«, by Lemmas VI and IX. 

Case II. p on S2’. We use the work of Theorem XV and Lemmas VI 
and IX exactly as in the preceding case. The double integral over y and 
(a, 8) therefore behaves as described. 


THEOREM XVII. Under the hypotheses of Theorem XV, 


im f(y) f np"-(G(x,y ; p*) — G(x,y ; p*))dedy = 0, 


im f G(x,y ; d) — Glx,y ; = 0 


uniformly for all x on (0, 1), for any summable function f(x). 


We combine Theorems IV, XV, XVI to obtain the limit of the first 
integral. If we take arcs 7 on two adjacent sectors S’ and transfer the results 
obtained for the integrals over them to the A—plane, we find the second limit. 
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As Theorems IX and XII resulted in the first part of Theorem XIII, 
so Theorems XII and XVII lead to 


THEOREM XVIII. The term-by-term difference of two Birkhoff series of 
order n=2y—1 for any summable function f(x) converges uniformly to zero, 
O<x<1, provided that the boundary conditions of the two differential systems 
defining the series can be reduced to normal forms for which a=a,--- , 
On =Oin, Bi=hi,-- +, Bn=Bn, ki=hi,- +--+, Rkn=kn. Such Birkhoff series are 
equivalent on (0,1). The differential system 


u™ + ru = 0, n= 2y—1, 
+ ButP(1)=0, > ki, 


anu*n)(0) + = 0, 


where a, B, k satisfy the conditions for regularity, can therefore be regarded 
as typical of regular differential systems of order n=2y—1 in the consideration 
of expansion problems with respect to the class of all summable functions. 


This theorem is the generalization to the case n»=2u—1 of the leading 
results of Haar and Walsh mentioned in the introductory paragraph. We 
shall apply it to the discussion of Birkhoff series at x =0 and x =1. 

We shall introduce two determinants D,(x, y, p) and De(x, y, p) which can 
be obtained from the two determinants A; and A, described under Theorem 
VII by replacing each term of the form [A] by the dominant constant A, 
and deleting the terms in e*“«. We can then prove an interesting inter- 
mediate theorem. 


THEOREM XIX. If f(x) is any summable function on (0, 1) and if G 
is the Green’s function for a regular differential system of order n=2y—1, then 


1 i=u 
lim (mp""G(x,y ; p”) — { = 
0 i=1 


R t=p+1 


1 
9, 920) )dody 0, 
0 


1 i=n 
tim f 469) f 5 -{- 
0 72 t=1 


Ro 


1 
D2(x,y,p))dpdy = 0, 
1 


uniformly, 0<x 31. 
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By Theorem XVIII, it is sufficient to consider the typical differential 
system there described. 

First integral. On substituting u;=e°***, in the 
explicit formula for the Green’s function, we obtain for p on S;’ the asymp- 
totic form given in Theorem VII, where now any expression [A] can be 
written as the sum of the constant A and a number of exponential terms 
formed by taking products of - - - , e?*-1, , ern, The ex- 
pressions [A] in the bracket terms and in the first row of the determinant 
A, reduce to the constant A. Since we can put 

= | | | eR (ewi)—R (rey) 1 (i=1,---,u-—1), 
1 (i=yn+1,---, 


for all p on S,’,we find [A] =A-+e°*m(p). If now we expand the determinant 
A, we see that 


= Pep | | | = (pwid—R (pep) 


i=n 


= Dilx,y,p) + em m(x,p) (x, p). 


i=l t=p+1 
Slight manipulation shows that 


1 1 1 
M(p) 


0 


for all p on S;’. Therefore we find 
' i=n 1 
i=l 


i=yu+1 0 


= coal > eres M; (x,p) + > M ; (x, p) )- (x,y, 2) 
i=1 i=u+1 

where the functions M are uniformly bounded for all x and y on (0, 1) and 

for all p on S,’. The first integral can now be brought under Theorem IV. 

For 


<M f = 2M(1 — < 2M, 
0 


f (x, y,p)dp 


while 
8 i=u i=n 
ffem( M; (x,p) + (xp) 
a i=l i=p+1 
= f eon > M; + 
i=1 i=p+1 p 


i 


0 


The theorem is therefore established for the first integral. 
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Second integral. By proceeding as in the case of the first integral, we 
show that the integrand in the second has entirely similar properties, due 
account being taken of the behavior of pw,on S;’. The theorem is then easily 
proved for this integral. 

In these facts, then, we see the expansion problem set by Birkhoff stripped 
of all asymptotic encumbrances and expressed explicitly as a problem involv- 
ing only the elementary functions in linear combination. 

On setting x =0, x=1, in the integrals of the preceding theorem we can 
effect still further reductions. We denote by D,°, D,', D2°, D2' determinants 
obtained from D,(0, y, p), Di(1, y, p), p), Da(1, y, p) respectively 
by appropriate modifications of the first rows; more exactly, in the last 
four determinants those terms of the first rows explicitly involving the 
variable p as an exponential are replaced by zeros. 


THEOREM XX. The limits described in Theorem XIX are still valid 
when x is set equal to 0 or 1 and the determinants D are replaced by the corres- 
ponding determinants D. For example, 


1 i=n 1 
lim f f (mp""G(0,y 5p") — — = 0. 


By Theorem XIX it is sufficient to show that 


1 
lim f (Dut, y,0) — = 0 

for i=1, 2 and k=0, 1. Since we have, by reasoning like that used in the 
proof of Theorem XIX, 


Dilk,y,p) — Di*(y,p) = 


where the functions m are uniformly bounded, 0<y1, for p on S;, k=0, 
1; and since we can obtain similar expressions for the difference D,(k, y, p) 
—Dz;'(y, p) for k=0, 1, we are able to complete the proof exactly as in 
Theorem XIX. 

To apply Theorem XX to the study of the convergence of Birkhoff series 
at x=0 and x=1 we demonstrate the three lemmas which follow. These 
lemmas are generalisations of standard theorems concerning the Dirichlet 
integral. 


Lemma X. If f(x) is summable on (0, 1) and if =e, and \ =e", where 
6, and 62 are two distinct constants on the range (—47, 37), then 


1 
lim f(y) dy=0, e>0O. 
7 


€ 
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Since the first factor of the integrand, f(y)/y, is summable on (e, 1) and 
since the second, e~**» —e-*”, satisfies the sufficient conditions of Theorem 
IV, as may easily be shown, the lemma is apparent. 


Lemma XI. If $(x) is summable on (0, 1) and of bounded variation on 
(0, €), then 


1 — 
lim | $(y) dy = $( + 0)(62 — 
We show that the function (e~**—e-*”)/y has two properties which 
are the essential elements in well known proofs of this lemma in the case of 
the Dirichlet integral. 
In the first place, we show that 


lim dy = (62 — 0,)i. 
y 


Row 


In the plane of the complex variable ¢ we determine a contour consisting 
of the rectilinear segments and (rA2, and two circular arcs 
c and C of radii r and R respectively described about the origin and lying on 
the right half-plane. Then 


1 dy Rs 
lim — — = lim lim (f —dt— f 
Row Ra r+0 rh, t rh, t 


By using the contour described and applying Cauchy’s theorem, we can 
replace the last limit by 
lim | —— dt — lim — dt = (02 — 6,)i — lim — dt = (02 — 4)i. 


The second property needed is that the integral 


b Rhy — 
dy 


is a uniformly bounded function of R for all a and 6 on (0, 1), no matter how 
a and b may vary with R. Clearly it is sufficient to show this for the special 
case On writing B= Rb, we find 


f dy = (2 ff 
0 ot 


whence the result is apparent. 
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It is now a simple matter to demonstrate the lemma.* 


Lemma XII. If ¢(x) is summable on (0, 1) and if 


1 z 
= $(x)dzx 


is of bounded variation on (0, €), then 


— 


lim | $(y) dy = &( + 0)(02 — 


Rw 


The proof can be carried out along the lines indicated by de la Vallée 
Poussin in a paper in which he established this result for the Dirichlet inte- 
gral.f As he points out, this lemma includes Lemma XII as a special case. 


We next prove 


THEOREM XXI. If f(x) ts summable on (0, 1), then the convergence at 
x=0 or at x=1 of the Birkhoff series of order n=2u—1 for this function is 
independent of the nature of f(x) outside arbitrarily small neighborhoods of the 
points x=0 and x=1. 


By Theorem XX the convergence of the series in question depends upon 
the behavior of certain integrals which we can write out explicitly and reduce 
to the type discussed in Lemmas X-XII. These integrals arise from the 
last columns of the determinants D. We use Ai, dz, 61, 42 to denote any con- 
stants satisfying the relations laid down in Lemma X. Then, by fundamental 
properties of pw, - - - , pw, on S;,S2, we find that the integrals in question can 
be put in the forms 


1 1 
frof werk -Wdpdy = fsa of 

0 n 0 n 

- J 
0 y 
where 6, — 0, = — 1/2n, k=l, B; 
ome Ry 


* Hobson, The Theory of Functions of a Real Variable, Cambridge, 1907, ed. 1, §450. 
t de la Vallée Poussin, Rendiconti del Circolo Matematico di Palermo, vol. 31 (1911), 
pp. 296-299. 
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where 6. — 0, = +7/2n,k =y+1, 


— 
f f) J I-Wdpdy = fr 


where 6, — 0; = — =1,---,u—1; 
1 1 e — 
f 10) = ff) ———ay 
0 0 y 
where 6.—0,;= +7/2n,k=y, - --,n. It isimmediately evident from Lemma 


X that the behavior of these integrals, and therefore of the series for f(x) 
at x=0 or at x=1, is independent of the nature of f(x) except near these 


points. 
The use of the integrals computed in the preceding paragraph and of 
Lemma XII enables us to state 


THEOREM XXII. If o(x) is summable on (0, 1), and if 
1 
XJ 0 
are of bounded variation near x =0, then 
1 1 
f np""'G(k,y ; p")dpdy = Aix Pi( + 0) + Bir + 0) 
vi 


where A, and B,, are constants depending only upon the boundary conditions, 
fori=1,2, k=0,1. For example, 


AioPi( + 0) + BioP2( + 0)= + 0) 


0 0 
i=n 
1 i=1 t=y+1 
4n8o 
t=y~+1 


where a,=,(+0), a2=&2(+0). The other constants may be explicitly deter- 
mined in the same manner by reference to Theorem XX and the forms of the 
determinants D. 
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This theorem is a generalization of the result announced by Birkhoff for 
the case = 2y*. 

We shall make no further applications of Theorem XX. In Theorem IV 
the sufficient conditions given are also necessary; by combining them with 
Theorem XX we find necessary conditions that 


lim foe. ; A) — G(x,y ; d))dady = 0 


uniformly, 0<x<1. It seems as though these conditions in their simplest 
form should involve only the constants a, @, 8, B, k, k, but we have been 
unable to ascertain the accuracy of this conjecture. 


V. THE ADJOINT SERIES FOR »=2u—1 


We shall next take up the expansion problem which comes to us from the 


integral 
1 1 
addy, 


where G(x, y; A) is the Green’s function associated with a regular differential 
system of order n=2u—1. If this differential system admits an adjoint, 
in which the parameter appears as —A instead of +A, the Green’s function 
for this adjoint system is precisely G(y, x; 4). Since the adjoint system is 
essentially a regular differential system of order »=2yu—1 of the type dis- 
cussed in §IV, the work of that section can be applied directly. Since the 
differential systems which we are discussing are too general to admit ad- 
joints except in restricted cases, we are faced with a new expansion problem. 
The series obtained we call the adjoint series of order » =2yu—1, for obvious 
reasons. Fortunately it is possible to throw the problem into a form which 
presents such similarities to that discussed in §IV that we do not need to 
do more than paraphrase the principal theorems demonstrated hitherto. 

In fact, if we make the substitutions z=1—x, 5=1—y, /(5)=f(1—5), 
we find at once 


frofco. ; daddy = f 5) GA 9,1 25 Nanay. 


Then, by interchanging x and y in the asymptotic forms of Theorem VII and 
subsequently making the substitutions described, we obtain asymptotic 


* Birkhoff, Rendiconti del Circolo Matematico di Palermo, vol. 36 (1913), pp. 125-126. 
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forms for np*-'G(1—¥4, 1—%; p") which are to all intents and purposes the 
same as those studied in the preceding section. The fact that in the deter- 
minants involved the terms in x and y occur in the last column and the first 
row respectively instead of vice versa is entirely unessential. Thus for the 
difference G(1—4, 1—Z;4) —G(1—5, 1—@;) where G(x, y;) and G(x«, 
are the Green’s functions for two regular differential systems of order 
n=2u—1 we can state theorems analogous to Theorems IX and XVII. 
For each Green’s function there is a theorem parallel to Theorem XII. 
Then we find the analogues of Theorems XIII and XVIII. Theorems XIX, 
XX, XXI and XXII likewise have images in the present case, not only by 
reasoning parallel to that of §IV, but also by the fact that the Green’s 
function can at this stage be regarded as arising from the typical differential 
system of Theorem XVIII, which admits an adjoint falling under §IV. 
When these theorems have been phrased in terms of the variables x and 9, 
it is easy to return to the variables x and y. 


Thus we have 


THEOREM XXIII. If f(x) is summable on (0, 1) 
1 
0 Cc 


uniformly, 0<aSx<b<1. 


THEOREM XXIV. If f(x) is summable on (0, 1) and if R,(y, x) is the 
residue of G(y, x;) atX=X,, then 


1 
lim | f(y)R.(y,x)dy = 0 
uniformly, 0<x <1. 


THEOREM XXV. On any closed interval (a,b) completely interior to 
(0, 1) the term-by-term difference of the adjoint series formed for any summable 
function f(x) in connection with two regular differential systems of order 
n=2un—1 converges uniformly to zero. We say that these adjoint series are 
equivalent on (a, b); in particular, they are equivalent to Fourier series on (a, b) 


The last remark results from the fact that the Fourier series obtained as 
special Birkhoff series in § III are not altered by an interchange of x and y. 
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THEOREM XXVI. The Green’s functions associated with two regular 
differential systems of order n=2n—1 for which , Gn=Qn, 
+, ki=hi, , Rn=kn, are such that for any summable function 


f(x) 
tim G(y,2 52) — Gy, 2; = 0 


uniformly, 0Sx <1. 


THEOREM XXVII. On the interval (0, 1) the term-by-term difference of 
the adjoint series formed for any summable function f(x) in connection with two 
regular differential systems of the form described in Theorem XXVI converges 
uniformly to zero. Such adjoint series are equivalent on (0,1). The adjoint 
series arising from the regular differential system 

u™ + ru = 0, n=2u—1, 
+ Biu(1)=0, n-12h 2 hy kita > hi, 


(0) + = 0 


may be regarded as typical, so far as the class of all summable functions is 
concerned. 


THEOREM XXVIII. If f(x) is summable on (0, 1) then 


lim 40) f { i~ > 


1 
Di(y, 2,0) )dedy ” 
0 


1 
D2(y,x,p) dpdy 0, 
1 


uniformly, 0Sx <1. 


We now obtain from the four determinants D,(y, 0, p), Di(y, 1, p), 
D:(y, 0, p), De(y, 1, p) respectively four new determinants D,°, D1’, D2°, D2’ 
analogous to the determinants D introduced in Theorem XX. In any deter- 
minant D the terms in the last column explicitly involving the variable 
p are replaced by zero to give the corresponding determinant 9. 
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THEOREM XXIX. [If f(x) is summable on (0, 1) the limits described in 
Theorem XXVIII are valid when x is set equal to 0 or 1 and the determinants 
D are replaced by the corresponding determinants D. For example, 


i=1 


THEOREM XXX. If f(x) is summable on (0, 1), then the convergence at 
x =0 or at x=1 of the formal adjoint series of order n=2y—1 for this function 
is independent of the nature of f(x) outside arbitrarily small neighborhoods of 
the points x =0 and x =1. 


THEOREM XXXI. If $(x) is summable on (0, 1) and if 
1 1 7 
are of bounded variation near x =0, then 


1 1 1 

lim ——— oy) f np""'G(y,k ; p")\dpdy = AizPi( + 0) + Biz + 

where the constants M and B depend only on the boundary conditions, i=1, 2, 

k=0, 1. In particular, 


Aiohi( + 0) + + 0)= + 0) 
n 


a; ay an 0 
i=n 
ki 
1 
i=y+1 


The other constants may be expressed similarly by reference to Theorem XXIX 
and the explicit forms of the determinants D. 


This completes our study of the adjoint series of order n =2y—1. 


VI. THE DERIVED SERIES OF BIRKHOFF, n=2u—1 


It is now convenient to restrict our attention to the regular differential 
system 
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Wi(u) = 0,---, W,(u) = 0, 


where f2,--- , fn are real or complex functions of the real variable z, 
continuous together with their derivatives of all orders on (0,1). The derived 
series of Birkhoff are then expressed as the integral 


Ax* 


f (k= 1,2, -- 9), 
while a method of summing these series is given by 


2ri 
If we wish to consider the series only for k=1, 2,--- , K, the functions 
pz, - + +, px do not have to be so heavily restricted; but the complications in 


statement thus introduced do not yield a proportionate increase in interest. 


Lemma XIII. Under the present hypotheses 


r4 k+l oF 
- fro (: d)dAdy 


k+l oF 


k+l 
f -~) G(x,y ;A)dd 
A} 


is continuous together with its partial derivatives of all orders with respect 
to x for all x and y on (0, 1) we have 


Now on the ranges OS OSx<1, and x<yS1, OSx51, G(x,y; dr) 
is continuous together with its partial derivatives of all orders with respect 
to x. Hence, on each of these ranges 


oF k+l k+l o* 
== { (1 Gdddy = f (1 ~) —Gdrdy. 
Ox* As Cc A? ox* 


Since 
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The lemma follows at once. 
Lemma XIV. The functions v,(y),--- , Un(y) defined by the identity 


u(y) u(y) nly) 


7ini(y) = 


have for a given sector S the asymptotic forms 


— l=m B E; 
vi(y)= » i(y) (y 
mp” 1 


(pw;)! gee 


i=1,---,m, where the functions B,(y),1=1, - - - , m, are continuous together 
with their derivatives of all orders on (0, 1), and the functions E are uniformly 
bounded for all y on (0, 1) and for all p of sufficiently great absolute value on S. 
The functions B,(y) are independent of the sector S.* 


We refer to Theorem III’, from which we obtain 
Exx(y »p) 


u(y) = (pws) + (pws) ——— 
p 
where P,(z) is a polynomial in 1/z, with coefficients which are functions of y 
and with the constant term 1. The denominator determinant in the defining 
expression for the functions 2(y), - - - , ¥n(y) is known to be independent of 
y. Hence we have 


0 0 1 0 0 


(0) cee (0) 


* Milne, Bulletin of the American Mathematical Society, vol. 23 (1916-17), pp. 166-169. 


P 
np" 
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where P is the quotient 


Peleus) + + 
putt m+1 
Ey n—l E, n—l 


On expanding the determinants and performing the indicated division by 
the usual processes employed in dealing with asymptotic forms we find 


+ y = 0 
Ei(y »P) 
1 


Since w,w;/w; is an mth root of —1 reducing to w; for 7=1, it is clear that 
p) =i:(y, For we have the development 


(pw)? p™ti 


di(y,p) 


where A;(y, p) is analytic at infinity in the p-plane, and the functions B,(y) 
are continuous together with their derivatives of all orders on (0, 1) for/=1, 


,m. Thus 

t=1,---,m, and 

Biy) 
= 1 + + 

( (pw;)? putt 


—<——<— 
__| 
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i=1,---,m, as we were to show. Since the functions B,(y) are expressible 
in terms of the coefficients Ax.(y) which occur in the asymptotic forms for 
us, - - -, wu," and which are independent of the sector S, they are independ- 


ent of the sector S. 


THEOREM XXXII. [Jf f(x) is summable on (0, 1); and if G(x, y; d) ts 
the Green’s function associated with the regular differential system of order 
n=2u4—1 of the type described above; and if 


i=u s=k 
= (pws) > Aax(x)Bg(y), 
i=1 s=0 a+f=k—s 
i=u+1 a+fp=k—s 
i=u-1 s=k 
i=1 s=0 a+f=k—s 
i=n s=k 
s=0 a+S=k—s 


where A ax(x)is the coefficient of (pw;)~* in the asymptotic form for us (x) on S, 
Ba(y) the coefficient of (pw;)~* in the form for v;(y), then 


An\ k+l 
lim fof (1 (no Fist dpdy = 0, 
Ox* 

4n\ k+l 
lim 70) (1 (nor - ; Fist) dpdy = 0, 
Ox* 


uniformly, 0<asx<b<1,120. The expression 


k+l 
ff 


is equivalent on any interval (a, b) completely interior to (0, 1) to a linear com- 
bination with coefficients Aq(x) of means of order k+l, 1=0, formed from the 
Fourier series and their derived series to order k for the functions f(x)Bo(x) 
=f(x), f(x)Bi(x),-- +, f(x)Bi(x). On any interval (a, b) the problem of 
the derived series of Birkhoff of order n=2u—1 is reduced to a problem in the 
theory of derived Fourier series. 


Case I. p on S;’. We take m=k in the asymptotic forms of Theorem 
II’ and Lemma XIV. We then compute the asymptotic form of 


1926} SERIES OF FOURIER AND BIRKHOFF 745 
o* oF 
no) ; p”) ; 
on S,’ by methods like those used in Theorem VII. We have 


1 = 1 i=n 
g(x,y) = ui(x)oi(y) ; — 
t=1 


W W (ttn) Wi(g) 


W,(u1) W (ttn) 


We multiply the first » columns in the numerator by 32:(y),--- , 30,(y) 
respectively, the next u—1 by —42,4:(y), - - - , 32n(y), and add to the last. 
We then take the first term of the new last column outside the determinant. 


The result of substituting the asymptotic forms for - , Mn, 
is 
a'G k mi(x,¥,p) 
i=n A (k) 
p [80] + 


where A,‘ is a determinant which differs from the determinant A, of Theorem 
VII only in having the elements of the first row changed to 


a1; = (pw ;)*e*i=[1], j 1, 
nj = (pw [1], j + 1, 
= O. 


Thus we find 


p 4n\ k+l mM; 


i=n Mm; pi” k i=n 


i=l i=u+1 


R4 


p 
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where the functions m are uniformly bounded for all x and y on (0, 1) and 
for all p on S,;’. Consequently, by Lemmas III, IV, V, 


k+l 


is uniformly bounded, 0<a<x<b<1, 0<y<1, for all p on S;’. Next, on 
integrating over (a, x) with respect to y just as we did in Theorem VIII, 
we find 


k+l a*G 


t=1 p t=u+1 
(k-1)+ (141) i=u i=n 
R 


The expression on the right approaches zero uniformly, 0<aSx<b<1, 
as R-« by Lemmas IV, V, VI. Thus 


4n\ k+l 
li ant Fi ; )\dpdy = 0 


uniformly, 0<a <x<b<i. 

With the aid of Theorem IV the present theorem is established in the 
case of the first integral. 

Case II. pon S,’. We compute the asymptotic form for 


Ox* 


in a manner analogous to that used in Case II of Theorm VII. We then 
follow our usual line of attack to validate the assertion concerning the 
second integral of this theorem. 

We notice that in the case of Fourier series, as given in the differential 
system of order m =2u—1 analysed in §III, 


— (pus) , 


t=1 


i=n 

Fi,= + DO (pai) 


t=1 


i=n 


Fas + 
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Thus 


FA= DFA t#=0,1, 1,2. 


e=0 a+p=k—s 


Since the functions A and B are independent of the sector S, the important 
relation between the derived series of Birkhoff and the derived series of 
Fourier, whose complete statement appears above, is now demonstrated. 

It is now proposed to investigate a few of the leading properties of the 
derived series of Fourier series; some of them can be extended to Birkhoff 
series. The results which we shall present have all been obtained by W. H. 
Young,* who used methods different from ours. 


THEOREM XXXIII. If (x) is a k-fold integral in the sense of Lebesgue 
on (0, 1), and if G(x, y; d)is the Green’s function associated with the differential 
system +hu =0, u(0) —u(1) =0, then 


1 1 Xr k+l 
tim (=~ f (:-~) G(x,y ; A)dddy 
0 Cy Aé 


1 oF 1 r* k+l 
— 1—-—) G(x,y;dady) =0, 


uniformly, O0<asx<b<1, forl1=0. In other words, the Cesaro sums of order 
k+l, 120, for the kth derived Fourier series of @(x) and for the Fourier series 
of ¢\”(x) are equivalent on (a, b). 


By Theorems IX and XXXII we need consider merely the differences 


z Xr k+l 
f f (1 — — drdy 
0 


4 


z 
1 
By integration by parts we can show 


f — dy = > ( — — 
0 


+ f 
0 


* W. H. Young, Proceedings of the London Mathematical Society, ser. 2, vol. 17 (1918), 
pp. 195-236. 
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i=k 


1 


i=1 
+ f 
1 


On substituting these expressions we find 


k k+l 


t=k AA 
+E 


(k—i)+ (+i) 
+4 > ( — 1)*-ig-)(1) ¢ 

The integrals over C vanish identically by Cauchy’s theorem. The integrals 
in the remaining terms approach zero uniformly, 0<a<x<b<1 as A—~, 
120, by the application of Lemma V. 

By Theorem VI, Corollary I, the type of mean employed in summing the 
two series may be replaced by a Cesaro sum of the same order, k+. 


Coroitiary I. The (k+1)th Cesaro sum of the kth derived Fourier series 
for $(x) converges almost everywhere to o”(x); and converges uniformly on 
(a, b) if p(x) is continuous. 


Corotiary II. We W(x) is bounded and summable on (0,1), 


k+l 
lim al 69) f. (1-5) 


for all x, and x interior to (0, 1). 


On applying the preceding theorem and its two corollaries to Theorem 
XXXII we find 


THEOREM XXXIV. Jf (x) is a k-fold integral in the sense of Lebesgue, 
Osx<1, then 


lim —- —- af oof. (1 ~) G(x,y ; = 


poo Ox* Jo 


almost everywhere, 0<x<1; and if $‘”(x) is continuous, the convergence to 
the limit is uniform,0<asx<b<1. 
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By Corollary I of Theorem XXXIII and by the last part of Theorem 
XXXII, we see that 


1 oF 1 r*4 k+l 
lim f (1 -~) G(x,y ; = 
c, 


+00 2ri Ax* 0 


almost everywhere, 0<x<1, the convergence being uniform on (a, 0) if 
(x) is continuous. By Corollary II of Theorem XX XIII we can compute 
the integral of ®,() by integrating under the limit sign on the left. We have 


— = f = Bia(x) = --- = 


Since o(x) =¢(x), it follows that ®,(«) =¢‘*-» (x) and the theorem is proved. 


. THEOREM XXXV. If f(x) is summable on (0, 1), then the convergence at a 
point x» interior to (0, 1), of ‘the (k+1)th Cesdro mean, 120, of the kth derived 
series of the Fourier series for f(x) is independent of the nature of the function 
outside an arbitrarily small neighborhood of the point xo.* 


We consider the differential system u’+du=0, u(0)—u(1) =0, starting 
from Theorem XXXII just as we did from Theorem XIX in proving Theorem 
XIV. The study of the (k+/)th mean of the kth derived Fourier series reduces 
to that of the expression 

1 +1 


z0 
— | (1 — — 
2a 0 -1 


1 1—zo +1 
+ re f f(z + xo) f (1 — — 
| 


where the integrals with respect to ¢ are taken over the real axis. We let 
A be an arbitrarily small fixed positive quantity, which we may suppose 
less than x» and 1 —2; we denote by / the function 

— 

which is summable on (—1, 1) for/=0. Then on integrating by parts k+1 
times we find 


+1 ( + 1)*+! +1 


-1 | 


* W. H. Young, Proceedings of the London Mathematical Society, ser. 2, vol. 17 
(1918) pp. 195-236. 
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an expression which is uniformly bounded for z2A,/20, and which ap- 
proaches zero with 1/A for each positive value of z, by the theorem of 
Riemann-Lebesgue. It follows that in the expression above the integrals 
over (A, xo) and (A, 1—<») have the limit zero as A>. Therefore the con- 
vergence of the (k+/)th mean, /=0, at x =x, depends on the behavior of the 
sum 


1 A +1 
0 -1 


1 +1 
f° fle + f (1 — — 
0 -1 


This establishes the theorem. 
An immediate consequence of this result is 


THEOREM XXXVI. If f(x) is summable on (0, 1), the behavior of the 


expression 
1 oF 1 k+l 
— 1-—) G(x,y;d)dddy, 120 
= ( (x,y; drdy, 120, 


at the point x =x,0<29<1, as vo is independent of the nature of f(x) out- 


side an arbitrarily small neighborhood of xo. 


We now proceed to two other theorems of more general character, with 
a view to showing the effectiveness of the method here considered in discus- 
sing the derived series of Fourier series. 


THEOREM XXXVII. If f(x) is summable on (0, 1); and if 
f(x) —f(2x0— x) 
2(x— xo) 
is summable on (0, 1) when x is interior to (0, 1) and f(x) is defined for all 
values of x by the identity f(x)=f(x+1); and if Ss, 24 denote the [A]th 
Cesaro sums of order | for a Fourier series and its first derived series respectively, 
then 


F(x) = 


— + = 0, mo, 120. 


In particular, if f(x) is continuous and limz.2/ (x) =A, then 


lim, x = Xo, l > 0.* 


* W. H. Young, Proceedings of the London Mathematical Society, ser. 2, vol. 17 
(1918) pp. 195-236. 
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It is sufficient to demonstrate that 
lima Y(f) — SSKY(F) + 45K (F)) = 0, x = xo. 


By taking k =1, /=0, in the final expression of Theorem XXXV and notic- 
ing that 


+1 
f (1 — $*)¢ cos A¢zdd = 0 


-1 


we find that we can replace 2,‘ (f) by the expression 


1 +1 
=f (f(z + x0) — f( —z+ x0))] (1 — sin Adzdgdz, 
TJ 0 -1 
in the sense that this expression has the same behavior as A. On inte- 
grating by parts we find 


1 1 +1 
(1 — ¢*)¢A? sin Adzdd = - (1 — 5@*)A cos Agzdo. 


-1 ZJ-1 
Hence we can replace 2,‘"(f) by 


f" f(z + x) — 2+ — 5¢*)A cos Adzd¢dz. 


2nJo Zz 


We note that F(—z+20)+F(z+20) =[f(2+2%0) —f(—2+%0)]/z. Thus we 
can replace S{)(F) and S{°(F) by 


f(z + x0) — f( — 2 + x) — cos Adzdgds 


2rJo ~1 
and 
1 74 +1 
f fis + f A cos Adzdodz 
2m. 0 Zz -1 


respectively. When we make the three replacements in the expression 


we find that it then reduces to zero; that is to say, for x =%o, 


(f) — 5S, (F) + (F)) = 0. 


The remainder of the theorem follows at once. 


| 
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THEOREM XXXVIII. The first derived series of the Fourier series for a 


function of bounded variation, x(x), is summable (C, 1), 1>0, to x'(x) almost 
everywhere on (0, 1).* 


We employ here the concept of integration with respect to a function of 


bounded variation. We let x be a point interior to (0, 1) for which the func- 
tion 


x(z + — x( — z+ 2%) 


X(z) = — 2x'(xo) 


is defined and satisfies the conditions 
1 
X(+0) =0, lim =f | dx| = 0. 
z 0 


Young has shown that the set of points thus defined is of measure 1. 


Taking G(x, y; X) as the Green’s function for the system u’+du=0, 
u(O)—u(1)=0, we find for x =x, 


(1 ~ ; d)dddy 
fof (1 _ 
0 v1 
+f f (1 ~) 


+ x(o) ( i- dddy 
0 


As 


ery 


In the third integral we have 


|x(1)| +] x(| + f | ax! 


IIA 


M, 


* W. H. Young, Proceedings of the London Mathematical Society, ser. 2, vol. 13 
(1914) pp. 13-28. 
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so that for />0 it approaches zero with 1/A in accord with Lemma V. By a 
similar argument we can dispose of the fourth integral. The two remaining 
integrals can be changed into the form 

z0 


1 +1 
x(—2+ x0) (1 — 64)" — 


2rJo -1 
1 +1 


x(z + xo) (1 — — 
0 -1 

as was done in Theorems XIV and XXXV. We take A positive but less 
than x» and 1—2». Then we show the integrals over (A, xo) and (A, 1—2o) 
negligible, in the following manner: we integrate by parts with respect to z 
in each of them and apply the inequalities of Lemma VII to the resulting 
expressions. The discussion has many points of similarity with the first 
part of that given in Theorem XIV. Thus we consider 


1 A +1 
-f (X(z) + zx’(a0)) | (1 — sin Agzdgdz, 1>0. 
T/0 


It is readily shown that 


1 +1 
lim -f ef (1 — sin Adzdodz = 1. 
0 -1 

To prove the present theorem, then, it remains to show that 


A +1 
lim X(z) (1 — $*)'pA? sin Agzdodz = 0. 
Awa 
This is easily accomplished after an integration by parts with the aid of the 
inequalities of Lemma VII; the method is that used by Hardy for the proof 
of the last step in Theorem XIV and by Young for the present theorem. 
This finishes the proof. 
We leave the reader to apply the last two theorems to the study of the 
derived series of Birkhoff of order »=2y—1. 


VII. THE SERIES FOR n=2y 
The regular differential system 


+ te + + .---+ = 0, 
W(u) = @,--- ? W,,(u) 0, 2u, 


where 


W(u) = 4+ Byu(1) 
kn, Rise > Ry, 


| 
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presents certain salient differences from the case n=24—1; but these are 
differences of form which do not necessitate any essential alteration in the 
spirit of the method used. 

We shall write \=p", considering instead of the whole A-plane two 
adjacent sectors S: R(pw:)SR(pwe)S - - - SR(pw,) in the p-plane. This 
region we shall call =. The sector S is a region T of §I forc=0. On employing 
the asymptotic forms for solutions m,--- , “, on a region T including S 
in its interior we ascertain the distribution of the characteristic values in S.* 
On one of the bounding rays of a sector S, R(pw,) = R(pwy+1) =0; we arrange 
the two adjacent sectors S forming = so that they have such a bounding ray 
in common. Then the characteristic values of p on © are distributed in two 
series, asymptotically near points equally spaced at distance 2 along each of 
two lines parallel to the bisecting ray of 2; these are simple characteristic 
values except in the case of a value common to the two series, which is 
always double. 

We now prepare the regions 2’, S’ in a manner entirely analogous to that 
employed in the case n=2u4—1. The arcsT’, y and the circles C can then be 
defined. The notations a, R will be carried over from our preceding work. 

On any region S we havet 


R(pwi) S R(pw2) S --- —B <0; 
R(pw,) = R(pw,.+1) = 0, along one bounding ray; 
= — 


R(pwn) = R(pwa-1) = --- = R(pwn-2) B > O. 


Hence we can consider the whole sector S’ at once; but we have two roots of 
—1, namely w,, w,4:, which have to receive special attention. This is the 
reverse of the situation in the case n =2u4—1, where the sector S’ had to be 
divided and where there was on each sector S;’, S,’ a single root of —1 
which required separate consideration. : 

In the present discussion we have to replace Lemmas IV, V, VI. We 
have 


LemMAIV’. If m(x, p) is a function of x and p bounded for0<asx<b<1 
and for all p on S’, then 


lim p*e*iz=mdp = 0 (i=1,---,u-—1), 
Roo Jy 


* Birkhoff, Rendiconti del Circolo Matematico di Palermo, vol. 36 (1913), p. 117. 
+ Birkhoff, these Transactions, vol. 9 (1908), p. 386. 
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lim = 0 
Roo Jy 

for any k, uniformly, 0<aSx<b<1. 


Lemma V’. If m(x, y, p) is a function of x, y, and p bounded for 0<asx 
<b<1,0<sy<1, and for all p on S’, then 


moms, 
pi” k+l 


for any k=0, (1) are uniformly bounded, 0<asxSb<1, OSyS1, for all 
y on S’ when 1=0, and (2) approach zero uniformly with 1/R for 0<asxSb 
<1, OSy<1, when 1>0. 


Lemma VI’. If m(x, y, p) is uniformly bounded, OSx<1, OSyS1, 
for all p on S’, then 


m 
lim f feo dpdy = 0 (i =1,---, 
Rw Ja p 
uniformly, and 
m 
lim f few —dpdy = 0 
a v3 p 


untformly, 


'The proofs are entirely analogous to those used for the preceding lemmas. 
Lemma III does not need to be replaced. 

The Green’s function for a system of order n =2y is given by the explicit 
formula written down in Theorem VII. We proceed to put this expression 
in a more useful form by multiplying the first » columns in the numerator 
by 32:(y), - - - , 30,(y) respectively, the next wp by —40,4:(y), - 
respectively, and adding to the last. We then take the first term in the new 
last column outside the determinant. The result of substituting the asymp- 


totic forms of 2, - - - , Un, %1, * is then* 
i=p i=n 
np" G = { [ws] epwi(z—y) lod} 


+ 
+ + [82] 


* Tamarkin, Rendiconti del Circolo Matematico di Palermo, vol. 34 (1912), p. 364. 


map, 
| 
(z 
1 
Me 
pw 
e 


756 M. H. STONE [October 


where A; is a determinant of order +1 in which the element a,;, 4=0, 1, 
m,j=1, 2, ---,n+1 may described as follows: 


and fork=1,2,---,n, 
ang = [anwf*],j =1,---, 15 = + ; 


i=n 


t=1 
On S’, 1/ {[0:}e%*»+[0,]e#-+[0.]} is bounded.* In short, we are prepared 
to proceed as we did in the case n=2y—1. 

We can thus establish Theorems IX’, XII’, XIII’, whose statements are 
identical with those of Theorems IX, XII, XIII, respectively, except for the 
change from »=2y4—1 to w=2y. In the proof of Theorem XII’, essentially 
the same as that of Theorem XII, we must remark a slight difference. If 
we let [c] be that part of ¢ on S’, then 


f 5 
0 


approaches zero uniformly, 0 <x <1, as the center of o recedes indefinitely 
from the origin. Hence on 2, 


ff f(y) np" "G(x, ; e")dpdy 


approaches zero in similar fashion. Because of the distribution of the charac- 
teristic values when ”=2y the circles ¢ remote from the origin may be inter- 
sected by the bisecting ray of 2, and it is necessary to introduce the 
corresponding modification of the proof. Theorem XIII’, which asserts 
the equivalence of Birkhoff series of order »=2y and Fourier series on any 
interval (a, 6) completely interior to (0, 1), is the strongest theorem possible, 
in general, as we see by reference to the sine, cosine, and Fourier series in 
the case n=2. 
We next introduce 


Lemma VIII’. On any region S’ 
(0: + [Bole + 
[Bi Je? + [Bojer + [82] 
if 05 = = 61, 0, = 


[1] 


* Birkhoff, Rendiconti del Circolo Matematico di Palermo, vol. 36 (1913), p. 120. 
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Lemma IX’. The integrals 


f f 9) dpdy 


where m(x, y, p) is bounded for all x and yy on (0, 1) and for all p on S’, converge 
uniformly to zero with 1/R, 0Sx51,0SaKl. 


We can now obtain Theorems XVII’ and XVIII’ which may be stated 
merely by replacing »=2u—1 by m=2y in Theorems XVII and XVIII. 
It is to be noticed that Theorem XVIII’ includes as special cases the leading 
results of Haar and of Walsh cited in the introduction. 

We next introduce the determinant D(x, y, p) obtained from As by re- 
placing each asymptotic expression [A] by its dominating constant term A, 
and deleting the terms involving e”+, e*«+1, From D(0, y, p) and D(1, y, p) 
we obtain new determinants D°(y, p), D'(y, p) by replacing by zero the 
terms of the first rows explicitly involving the variable p. We then obtain 
two theorems similar to Theorems XIX and XX. 


THEOREM XIX’. If f(x) 1s summable on (0, 1), then 


1 i=p i=n 
i=1 


1 
D(x,9,6) \dedy =0 
2 


uniformly, 31. 
THEOREM XX’. If f(x) is summable on (0,1) then 


1 i=n 1 
tim f 109) f (nor'G00,y D%y,0) = 0, 
0 


2 
1 1 


We can next use Lemmas X and XII much as we did in §IV. In fact, we 
have, with the notations of Lemma X and Theorem XXI, 


1 1 e 
= f 10) dy 
0 0 


737 

| 
at 
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where 6.—0,;= —2/n, for k=1,---,m; and 


1 1 e 
f f(y) f wpe = f f(y) dy 
0 0 


where 6.—6,;=2/n, for k=yu+1,---,n. Hence we obtain Theorem XXI’, 
whose statement parallels that of Theorem XXI. Likewise there results 


THEOREM XXII’. If $(x) is summable on (0, 1), and if 


ir if 
=~ d(x) =~ f adds 

x 

are of bounded variation near x =0, then 
1 1 
lin f np""'G(k,y ; p")dpdy = Ax + 0) + Be + 0) 
900 0 

where A,, By are constants depending only upon the boundary conditions, k =0, 1. 


For specific formulas concerning Ax, B;, we refer to a paper of Birkhoff.* 
In order to bring the discussion of the adjoint series of order n=2y 
under that already carried out, we employ the device of writing <=1—z, 


§=1-y, f(9)=f(1—y), whereby 


fro feo. ; \)dady - fan f ca — 


Then on S’ we find for mp"-"'G(1—4, 1—#; p") an asymptotic form which in 
its essentials resembles that for np"G(Z, 3; p"). We are then able to prove 
Theorems XXIII’, XXIV’, XXV’, XXVI’, XXVII’, differing from the cor- 
responding theorems of §V only in having »=2u—1 replaced by m=2u. 

The theorems on convergence at the end points x =0, x =1 present certain 
differences. We define D° and 9! from D(y, 0, p) and D(y, 1, p) by replacing 
by zero certain terms of the last columns, as in the, corresponding situation 
in §V. Then we have 


THEOREM XXVIII’. If f(x) :s summable on (0, 1), then 


fof (mp""'G(y, x; p") — { 5 


i=yp+1 
1 
#20) )dedy = 0 
2 


uniformly, <1. 


* Birkhoff, Rendiconti del Circolo Matematico di Palermo, vol. 36 (1913), pp. 125-26. 
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THEOREM XXIX’. If f(x) is summable on (0, 1), then the limit in Theorem 
XXVIII’ is valid if x is set equal to 0 or 1 and Dis replaced by the corresponding 
determinant 2; for example, 


1 = 1 
lim fof (np"'G(y,0 ; + — D(y,p))dpdy = 0. 


Theorem XXX’ is the exact analogue of Theorem XXX. 
THEOREM XXXI’. If $(x) ts summable on (0, 1), and if 


1 
xJ0 
are of bounded variation near x =0, then 
1 1 
oy) f np"""G(y,k ; p")dpdy = + 0) + Bi + 0) 
Y 


271 0 


where M, and B, are constants depending only upon the boundary conditions, 
k=0,1. For example, 


+ 0) + BoF2( + 0) = — + 0) 
a, 0 


k 
ay," — Bi Do 


where a;= —®;(+0)/w;, 7=1,2, +--+ - aud 
Lastly, we pass to the consideration of the term-by-term derived series, 


under the assumption that the coefficients f:,---, p, of the differential 
equation are continuous together with their derivatives of all orders. Our 
theorems deal with the method of summing the series given by the expression 


k+l 
(1 -~) G(x, ; 1205 


a 
i—y 
+—— 
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Lemmas XIII and XIV hold for »=2y as well as for m=2u—1. If we write 


FP(x,y,p) = DX Aan (x) Bs(y), 


s=0 


F(x,y,p)= + Dd Bay), 


e=0 


we find 


oF f 
np"— = 4F,° — > — + 
(k) 
As 


+ [Boje + [62] 


where A® is a determinant differing from As only in having the elements 
of the first row given by 
ao; = (pw [1] G = 1, ; 
do; = (pu) [1] G=u+i,---,m); 
@o.n+i = O, 
for p on S’. The demonstration of Theorem XXXII’ then follows the lines 
of that of Theorem XXXII. 


THEOREM XXXII’. If f(x) is summable on (0,1)and if G (x,y; ) is the 
Green’s function associated with the regular differential system of order n=2y of 
the type described above; and if F,°(x,y,p), Fi\(x,y,p) are defined by the iden- 
tities above, then 


uniformly,0 =0. The expression 


k+l 


is therefore equivalent on any interval (a,b) completely interior to (0,1) to a 
linear combination with coefficients Aax(x) of means of order kk +1,12= 0, 
Jormed from the Fourier series and their derived series up to order k for the 
functions f(x)Bo(x) = f(x), f(x) Bi(x), - - - , f(x) B(x). On any interval (a,b) 
the problem of derwed Burkhoff serves of order n = 2p 1s reduced to a problem 
in Fourier series and their derived series. 
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Theorems XXXIV’ and XXXVI’, analogous to the correspondingly num- 
bered theorems of §VI, can then be obtained without difficulty. 

From our brief indications the reader will perceive that the case m = 2y 
is in its main features similar to the case m = 24 — 1, which we have dis- 
cussed in greater detail. 


UNIVERSITY, 
New York, N.Y. 


ANALYTIC APPROXIMATIONS TO TOPOLOGICAL 
TRANSFORMATIONS* 


BY 
PHILIP FRANKLIN anp NORBERT WIENER 


I. INTRODUCTION 


A continuous transformation of a two-dimensional region into itself, or 
into a second such region may be characterized by a pair of continuous 
functions 

X = X(x, ¥), Y= V(x, y). 
The continuity of these functions, and the existence of a unique pair of 
inverse functions are the only restrictions imposed by the topologist. On 
the other hand, the differential geometer usually demands of the trans- 
formations he considers, that the functions defining them be analytic. It 
is a matter of some interest to investigate the relations connecting these 
two types of transformation, and in particular to determine whether the 
general transformations of the first type may not be regarded as in some 
sense limiting cases of the more special analytic type. This question is 
here discussed, and we shall prove that every continuous one-to-one trans- 


formation of a two-dimensional region of finite connectivity may be approxt- 
mated to an arbitrary degree of exactness by an analytic one-to-one transform- 
ation. 

The analogy between the present investigation and the Weierstrass 
approximation theorem is obvious. The peculiar difficulty of the problem 
here treated is the need of keeping our approximating transformation 
one-to-one. This requires a discussion of a combinatorial nature. 


II. CONTINUOUS TRANSFORMATIONS 
Let the transformation 
X = X(x, y), Y = ¥(x, ») 
transform the closed region r of the x, y plane into the closed region R of 


the X, Y plane. Let X(x, y) and Y(x, y) be continuous and single-valued, 
and let it be possible to write 


x= Y), y = Y), 
* Presented to the Society, May 2, 1925; received by the editors in May, 1925, and (revised) 


in March, 1926. 
t This theorem was suggested to the authors by Professor J. W. Alexander. 
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where these functions are likewise single-valued. We shall then call our 
transformation of r into R biunivocal (or one-to-one) and continuous. We 
shall show that the inverse of this transformation is likewise continuous. 

The continuity of this inverse transformation from R to r will follow if 
we show that any sequence of points P;, Ps, - - - approaching a limit P is 
transformed into a sequence fi, ps, - - - approaching a limit p. We form the 
set p;, the transform of the set P;, and notice that, since it is an infinite set 
in the closed region 7, it must contain at least one subsequence with a limit 
point, say ~. That is, 


lim pi, = p. 


From the continuity and biunivocal character of the direct transformation, 
it follows that 

lim P;, = lim P, = P. 

n=O 
Since this argument proves that any subset of the p; approaching a limit 
must approach p, we conclude that the set p; approaches # as a limit. 

In order to discuss approximations to transformations, we shall needfa 
measure of the distance between two transformations. If S and T are two 
transformations, with inverses S~! and T~-' respectively, then we define 
the distance between them as the greater of the two numbers 

maximum [S(P)—7(P)] and maximum [S-'(P)—T7-1(P)], 
where [A—B] denotes the distance from A to B, and the maximum is 
taken as P ranges over the entire region for which the transformations are 
defined. To approximate to within a given distance of a transformation T, 
it is sufficient to approximate for the direct transformation alone. This 
follows from the fact that if S, are a series of transformations such that 


lim maximum [S.(P) — T(P)] = 0, 


lim maximum [S.(P) — T-"(P)] = 0. 


For, we have 
— T-(P)] = [TTS>(P) — 


and hence, since S, is approaching T, and T-' is continuous, our contention 
follows. 


i} 
then 
fl 
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As we shall wish our approximating transformations to extend over the 
entire regions of definition of the transformation and its inverse, r and R 
respectively, if we kept to these regions we should have to take the boundary 
of r into that of R. In general this can not be done by any analytic trans- 
formation. We avoid this difficulty by seeking a transformation which 
shall exist in some region including r, whose inverse exists in some region 
including R, and which approximates the given transformation in the sense 
of having its distance from it, as defined above, arbitrarily small, say less 
than e. 

We restrict ourselves to the case where the region 7, and hence R, is of 
finite connectivity, bounded by a number of simple, closed Jordan curves. 
We embed ¢ and R in two large squares, s and S respectively, so chosen that 
no point of r is within a distance ¢ of s, and no point of R is within a distance 
eof S. We then map the region S — R on the region s —r by a continuous trans- 
formation, which agrees with the original transformation on the boundary of 
R and r, and takes the vertices of the first square into those of the second. 
This is seen to be possible by linear interpolation in the case where r and 
R are bounded by circles, and the general case reduces to this in virtue of 
the Jordan-Schoenflies theorem. 

If, now, we obtain an analytic biunivocal transformation 7; which 


exists in the square s and whose inverse exists in a square inside of S, con- 
centric with it and at distance ¢ from it such that the distance between the 
transformations 7; and 7;, the extended continuous transformation, is 
less than ¢, we shall have solved our problem for the original regions r and R. 


III. THE POLYGONAL NETWORK 


In approximating the extended transformation, 7,, which takes the 
interior, sides and vertices of a square s into those of a square S, we shall 
find a simultaneous subdivision of s and S of great service. In this section 
we shall prove 


Lemma 1. Given a continuous, biunivocal transformation T, which takes 
the interior, sides, and vertices of a square s into those of a square S, we may 
subdivide s and S into corresponding convex polygons, of which the number 
meeting at a vertex is one, two, or three according as the vertex is a vertex of the 
square, a point on a side of the square, or an interior point of the square, res pec- 
tively, and having ihe property that any continuous biunivocal transformation 
T; which maps each polygon of s on the corresponding polygon of S is at distance 
less than €/2 from 7. 
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Let H be selected less than 10¢e, and »(H) be chosen less than H, and 
so that 


[Pi — P2] < ZH if [p: — <9. 


Let us place on s a square network with mesh n(#), and sides parallel to the 
sides of s, and find its transform under 7;. Let Ni, Ne, - - - be the intersection 
points of this transformed network in S, including the points where the 
network cuts the boundary. Let the least distance between any such point 
and a point on a side of the net not abutting on it be 3H’. Surround each 
of the points Ni, Ne, - - - in S with a circle with radius H’. No two such 
circles can intersect. Furthermore, no such circle about NW; can intersect a 
side of a compartment of the net which does not terminate in N;. 

Consider now the side of the transformed net going from N; to Nj. 
Of all the points in which this side intersects the circle about N;, taken in 
their natural order on the side, there must be a last point P;. Similarly, of 
all the points in which the side intersects the circle about N;, taken in their 
natural order on the side, there must be a first point P,;. Form such pairs 
of points on all the sides of the network. Then there must be a minimum 
distance, 25, from the points on the segment of the network P; P; and the 
points on any other such segment, or the points on the H’ circles about ver- 
tices other than N; and N;. Take Q:, Qs, - - - , Qs any sequence of points on 
this segment P,P, such that the distances P; Q:, Qi Qe, - - - , Qn Pe are all 
less than 6. Replace the side NV; N; by the broken line VW; P: Q; - - - Qn P2 Nj, 
and similarly replace every other side of a mesh of the net by such a broken 
line. Then each mesh of the net will go into a simply connected polygonal 
region, and no point of this region will lie outside of the original mesh, and 
abutting meshes. Since the distance from a point in s to a point in an abut- 
ting mesh is at most 2./2n <3n, if T; is any continuous biunivocal trans- 
formation taking each mesh of the net on s into the corresponding poly- 
gonal mesh of the modified network just described, we shall have 

maximum [71(p) — 72(p)]<3H. 
Since the inverse transformation Ts! takes a point in S into the same or an 
abutting mesh into which 77" takes this point, we have 
maximum < 3n < 3H. 

We must now go from the polygonal network to convex polygons of 
the kind described in the lemma. We use 7;~' to transform the vertices of 
the polygonal network back to s. This gives a series of points m ~: m1 - 
gn p2 n; on the segment ”, n; dividing it into consecutive segments which we 
may put into correspondence with the segments of the broken line NV; P: Q: 
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- + + QO, P2 N;. We thus have a correspondence set up between the periphery 
of a rectangle in s and of a possibly reentrant polygon in S, for each mesh. 
We wish to show that it is possible to divide the two figures by straight lines 
into a finite number of similarly placed triangles. 

Consider the polygon. We shall say that a point on its boundary is 
visible from another point situated anywhere in its plane, if the two can be 
joined by a straight line which does not cut the boundary. For any vertex 
of the polygon, we may find a point inside the polygon, sufficiently close to 
it so that this vertex and the two adjacent ones are visible from it. If the 
polygon is not a triangle, we may shift this point so that from it some fourth 
vertex, as well as as the three consecutive ones we began with, are visible. 

Let us now consider one of the rectangles, and its corresponding, possibly 
reentrant, polygon. Select a vertex of the rectangle, and the corresponding 
vertex of the polygon. Finda point inside the polygonfrom which this vertex, 
the two adjacent ones, and a fourth are visible, and join it to these four by 
straight lines. Select a point inside the rectangle, so near the chosen vertex 
that, when the corresponding straight lines are drawn, all the new polygons 
will be convex. Now proceed similarly with one of the vertices of the convex 
polygons which is not collinear with its adjacent vertices. At each stage 
we reduce the number of sides of the polygon dealt with, and keep the new 
polygons convex, so as to allow the process to be repeated. After a finite 
number of repetitions, no polygon will have more than three sides, and we 
shall have succeeded in dividing our corresponding meshes into similarly 
placed triangles. 

The sides of these triangles in the two networks will have a lower bound 
o. Surround each node of each network with a circle of radius ¢/3. | Draw 
the inscribed polygon meeting the circle in the points where this circle cuts 
the lines of the network. These polygons will be convex, as will be polygons 
formed from the triangles by removing the portions inside these polygons. 
Further, the network of convex polygons will have the property that pre- 
cisely three of them meet at each vertex inside the square, two at each vertex 
on a side of the square, and just one abuts on each vertex of the square. 

Suppose, now, that 7, is a continuous, biunivocal transformation which 
takes each convex polygon in s into its corresponding convex polygon in S. 
Points in S arising from a given point in s under 7; and 7; respectively come 
from the same mesh or adjacent meshes under 7». Consequently, under 7; 
these points come from meshes at worst next but one, or points in s at dis- 
tance less that 3\/2 <5n, and we shall have 


maximum [7;(~) —72(p)] < 5H < €/2. 
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Similarly, the inverse transformations, Ty! and Ty", take a point in S into 
two points of s at worst in next but one meshes, and we have 


maximum [771(P) —Tz1(P)]<5n<5H <é/2. 


Thus the network of convex polygons which we have found satisfies all the 
conditions of Lemma 1, since the distance between T; and 7; is less than €/2. 


IV. DIFFERENTIAL TRANSFORMATIONS 


We shall now show that a transformation 7, of the type mentioned in 
Lemma 1 may be found which is defined by functions having partial deriva- 
tives with certain continuity properties. That is, we shall prove 


LeMMA 2. Given two squares, s and S, subdivided inio corresponding 
convex polygons, of which the number meeting at a vertex 1s one, two, or three 
according as the vertex is a vertex of the square, a point on a side of the square, 
or an interior point of the square, respectively; there exists a continuous bi- 
univocal transformation T,: X = X2(x,y), Y = Yo(x,y), for which X, and V2 
possess first partial derivatives with respect to x and y continuous in s, and 
for which the jacobian of X2, Y2 with respect to x and yy never vanishes in s, 
which maps the interior and boundary points of each convex polygon in s on 
those of the corresponding polygon in S. 


In setting up the differentiable transformation 72, we shall frequently 
have occasion to change our system of codrdinates. To carry over the proper- 
ties of the transformation from one of these systems to another, we prove 
a general result once for all. 


If a point transformation of one region on another is given parametrically, 
the parametric curves U=U(X, Y), V=V(X, Y), u=u(x, y), v=v(x, y) 
being such that (1) the (U, V)-(X, Y) relation is one-to-one, so that it may be 
solved in the form X=X(U, V), Y=Y(U, V), and similarly the (u, v)-(x, y) 
relation may be solved in the form x=x(u, v), y=y(u, v); (2) the functions 
U(X, Y) and V(X, Y) possess continuous first partial derivatives with respect 
to X and Y, and similarly the functions u(x, y) and v(x, y) possess continuous 
first partial derivatives with respect to x and y; (3) the jacobian of U, V with 
respect to X, Y never vanishes, and similarly that of u,v with respect to x, y 
never vanishes; then a one-to-one relation between U, V and u,v: U=U(u, 0), 
V=V(u, v), for which U and V possess continuous first partial derivatives with 
respect to u and v, and for which the jacobian of U, V with respect to u, v never 
vanishes, implies a one-to-one relation between X, Y and x, y: X =X(x, y), 
Y=Y (x, y), with similar properties at interior points of the regions, and con- 
versely. 
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For, by hypothesis (1), X and Y are single-valued functions of U and 
V. Also, by hypotheses (2) and (3), these functions possess continuous 
first partial derivatives with respect to U and V.* But, we are also assuming 
a differentiable one-to-one relation of U, V to u, v, and by hypotheses 
(1) and (2), there is a differentiable one-to-one relation between u, v and 
x,y. Since a differentiable, uniform function of a pair of similar functions 
yields a similar function, we see that X and Y are uniform, differentiable 
functions of x and y. The property of the jacobian follows from the fact 
that 

OX ,Y) a(X,V) aU ,V) a(u,r) 


(x,y) U,V) u,v) 


and 


a(x a(U,V) 
a(U ,V) a(X,Y¥) 


The second relation shows that the first factor above can not vanish, being 
the reciprocal of a finite quantity, and the second and third factors do not 
vanish by our hypothesis. 

The converse is proved in a similar manner. 

The principal types of coérdinate systems we shall use are three. Those 
of the first type will be rectangular cartesian coérdinates, with various orien- 
tations, and choice of origin. These clearly satisfy our three hypotheses, 
with relation to a fundamental coérdinate system, say that with origin at 
one corner of the square s or S, and axes along two of its sides. 

The second type are polar coérdinates, with the pole at an arbitrary 
place. It is evident that these satisfy all our conditions, for any region not 
including the pole, when the angular coérdinate ¢ is reduced modulo 27, 
and all the functions involving this codrdinate are assumed to be of period 
2m with respect to it. 

The third type are modified polar codrdinates. We begin with a closed 
curve which surrounds the pole, or origin, with a continuously turning tangent 
which never coincides with a radius vector. We use a series of curves, similar 
to this and similarly placed with reference to the pole, as our u-curves, taking 
the scale such that the length cut off by these curves on any fixed radius 
vector increases uniformly with u. The » curves are our radii vectores. 
Analytically, if r =f(t) is the equation of the closed curve in polar coérdinates, 
our parametric curves are related to polar codrdinates by the equations 


u=at+brf(t), v=t. 


* Cf. Osgood, Funktionentheorie, Berlin, 1920, p. 69. 
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We may now show that, in any region not including the pole, these parametric 
curves satisfy our three hypotheses. In such a region, a pair of values x, y 
yields a single pair of values r, ¢ and hence a single pair of values u,v. Con- 
versely, a pair “, v fixes ¢ by the second relation and then r by the first and 
thence x, y. Thus the first hypothesis is satisfied. The partial derivatives 
with respect to r and # are 0u/dr=Df(t), du/dt=br f' (t), dv/dr =0, dv/dt=1. 
These are all continuous, since our assumption about the tangent to the 
closed curve involves the existence of f’(#), finite and continuous. From 
the relation of polar to cartesian codrdinates, we see that, in a region 
excluding the pole, the second hypothesis is satisfied. For the third hypo- 
thesis, we have 


(u,v) A(r,t) f(t) 
(x,y)  a(r,t) A(x,y) 


which does not vanish since f(#) does not vanish. 

We next proceed to the explicit construction of the differentiable trans- 
formation 7;. Our method is first to set up affine transformations applicable 
to suitable neighborhoods of the vertices of the polygonal network, then to 
interpolate between these along the sides of the polygons, and finally to 
interpolate into their interiors. 

Since only one polygon abuts on the vertices of the squares s and S, at 
these points we may use the identity, or a similarity transformation. We use 
the latter: 


X=kx, Y= ky, 


taking & positive and so small that the adjacent vertices of the network in 
s are mapped by this transformation on the adjacent sides of the network in 
S. At vertices of the network on a side of the square, we have to transform 
this side, and a line meeting it, into the corresponding side and a line meeting 
it. Taking the vertices in question as our origin, and the sides of the squares 
as the y axes, so oriented that, with the standard relation of the axes, the 
positive x axes point inside the squares, we have to take the lines x=0, 
y=mzx into X =0, Y= MX respectively, where m and M may be positive or 
negative, by a transformation with positive jacobian. We may use 


X=kx, Y=ky+k(M—~m)x, 


where we again take k positive and so small that the three adjacent vertices 
of the network in s are each mapped on the interior of the corresponding 
adjacent side in S. At the interior vertices of the network, we have to trans- 
form three lines forming three convex sectors into three similar lines. Taking 
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the vertices in question as our origin, and one of the lines as the positive 
x axis, we have to transform the lines y=0, y=mx, y= —nx into Y=0, 
Y=MX, Y=—NX respectively, where m, n, M and N are positive, by a 
transformation with positive jacobian. We may use 


X = kmn(M + N)x + k(mN — nM)y, 
Y = kMN(m+n)y, 


where we take , as before, positive and so small that the three adjacent 
vertices of the network in s are each mapped on the interior of the corres- 
ponding adjacent side in S. 

We nowhave affine transformations defined at each vertex of our network. 
We surround each vertex in s by a small triangle, with sides perpendicular 
to the sides of the network meeting there, and define 7: inside and on the 
boundary of this triangle as the affine transformation just referred to. This 
gives a corresponding set of triangles in S. We must now interpolate between 
these transformations along the sides of the polygon. Let p; p. be a side in 
s, and P, P; be the corresponding side in S. If we take /, as origin, and f; ps 
as the positive x axis, and P, as origin and P; P, as the positive X axis, 
the affine transformation at /; will have as its equations 


i= a,x + biy, Y = C1y;, 


where 4; is positive from our choice of direction along the x axis, and ¢ is 
positive since the jacobian remains positive for all axes with the standard 
orientation. In terms of the same coérdinates, the affine transformation at 
pz will be 


A= d2x + boy + do, Y = 


Here a2 and ¢ are positive as before. We see that d,.=P; P2—4a2(p; pe) is 
positive, since our condition on adjacent vertices makes the image of p; 
under this transformation go into a point between P; and Pz, i.e. (ds, 0), 
the image of (0,0), is a point with positive abscissa. We also assume that 
a; =, which is permissible, since, if it were not the case, we could reverse 
the réles of p; and p2 which would interchange these quantities. 

If x=x, and x =x are the equations of the sides of the small triangles 
which cut /; f2, our problem is to find a differentiable, biunivocal transfor- 
mation which agrees up to the first partial derivatives with the transformation 
at p: for x =x, and with that at p2 for x=x:. We write 


| 


TOPOLOGICAL TRANSFORMATIONS 


(z— +4 (a1 + — 


“(ay — 2)? 


+ a 


(x — + (x1 + x2)(x — 


(x2 — X 


+ bay (x a)? — 


 m—m 

— *)* f, ~ 


+ dz 


Y=ay 


+ 


By direct calculation, we find a for x=,, these equations give 
X=amnt+hy, a@X/dx=a,, dX/dy = hi, 
Y = ay, OY/dx=0, <dY/dy = cy. 

For x they give 
X = aox2 + boy + de, OX/dx = a2, AX/dy = be, 
Y = coy, oY/dx=0, <dY/dy = cs. 


Thus this transformation joins to those at ~, and f. as desired. We also note 
that, for y=0, we have 


aX/dx = { (x2 — 21)? — 3(x1 + x2)(x — x1) } 


— #1) 


6(x — x1)(x%2 — x) 


(22 — 


+ dz 


For 0<%,<x <%2, the coefficients of a2 and d2 are positive. Since the sum of 
the coefficients of a, and a is unity, and we have arranged matters so that 
a,2a,>0, and d,>0, we see that the sum of the three terms is positive. 
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Hence, along the line y=0, X is a monotonic function of x, and the mapping 
is one-to-one. Again, for y=0, we have 
= 0, 
(x — x2)? — x) 


2X1 


aV/ay =c {1 == 


(x2 — 21)? 


— %2)? — 


For 0<x;<%x <2, the coefficients of c, and c, are positive, and since c, and 
G, are positive, OY /dy is positive. Hence the jacobian of X, Y with respect 
to x, y for the cubic transformation under discussion is positive along the 
line y=0. Thus the transformation is one-to-one im kleinen, and, since it is 
one-to-one along the line y=0, and continuous, we may find a strip including 
this line in which it is one-to-one im grossen. Furthermore, since the trans- 
formation has continuous derivatives, if this strip is taken sufficiently 
small, the jacobian of the transformation will be positive throughout the 
strip. 

We have now to interpolate into the interior of our polygons. In each 
polygon of the figure s, we draw a differentiable closed curve, close to the 
boundary of the polygon. We may form it of straight lines parallel to the 
sides between the x; and x2 used above in the strips, and circular arcs in the 
triangular sectors. If it is taken sufficiently close to the polygon, it will be 
visible from some point c inside the polygon, i.e. such that each radius 
vector drawn from ¢ to the curve cuts it in a single point, at an angle not 
zero. Thus it may be used as the closed curve of a system of modified polar 
coérdinates “, v of the kind previously described, with the pole at c. Consider 
the transforms of these u curves in S, under the cubic and affine transforma- 
tions. In a sufficiently narrow strip, these curves will be visible from some 
point C inside the polygon. In this strip we shall take as parametric curves 
the curves U =const., the transforms of u=const., and the curves T =const., 
the radii vectores drawn from C, T being the angle made with a fixed vector. 
To show that these are admissible parameters, we note that the one-to-one 
relation of U, T to X, Y follows from the fact that the U curves are visible 
from C, so that a pair of values U, T gives a single point X, Y. On the other 
hand, a pair X, Y clearly gives one value of T, and a single U resulting from 
the u obtained from x, y. The U and T curves clearly are differentiable. For 
the condition on the jacobian, we have 


a(U,T) _a(U,T) aU ,V) 
AX,Y) U,V) 


where the curves V =const. are the transforms of »=const. under the affine 
and cubic transformations. From the definition of the U, V curves, the 


i 
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jacobian of U, V with respect to x, y is the same as that of u, v with respect 
to x, y and hence can not vanish, since u, v were admissible parameters. 
The last jacobian does not vanish, since it relates to the cubic and affine 
transformations. For the first factor we have 


_ ar 
aU,V) av 


If this vanished, we should either have 07 /0U =0 at the same time, in which 
case since U and V are admissible parameters, we would be at a singular 
point on the curve (or in its representation), or we should have 67 /dU <0, 
and the T and U curves would be tangent. The first case is inadmissible, 
since the angular coérdinate represents the radii without singularities in any 
admissible system of coérdinates, and the second is excluded since the U 
curves we are using are all visible from C. Thus the U, T parameters satisfy 
our three conditions, and form an admissible system. 

We choose the scales so that the curves u=0 and u=1 are respectively 
the inner and outer boundaries of a strip entirely inside the region where our 
affine and cubic transformations are valid, while their transforms U =0 and 
U =1 are the boundaries of a strip inside which the U, T codrdinates are 
admissible. In these coérdinates, the affine and cubic transformations will 
have the equations 


u=U, t=F(T,U), 

where we replace v by #, its equal. We note that 
OF O(u,t) 
 U,T) 


In the strip between «=1 and the polygon, we define 7; as the affine and 
cubic transformation, which is that written above near u=1. In the strip 
from u=1 to u=0, we use a derived transformation, such that it agrees 
with this transformation up to the first derivatives at w=1, and takes the 
curves ¢=const. into curves tangent to the curves T=const. at u=0. The 
derived transformation is 


w= U, ¢t=F(T,2U? — =F,(T,U). 


This is one-to-one for values of u, or U, between 0 and 1. For, since u=U, 
t=F(T, U) is biunivocal for these values, for any value of u or U in this 
range, =F (T, U) gives a one-to-one relation of to T. But 2U?—U* takes 
on the values between 0 and 1 when U does, and accordingly for fixed 
uor U,t=F,(T, U) gives a one-to-one relation of ¢ to T. The tangency prop- 
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erty required follows from the fact that (@F,/8U)v.o=0, while (@F:/8T)v-o 
=(0F/8T)v-o>0. For u=U=1, we easily see that the derived trans- 
formation and its derivatives agree with the earlier transformation. Its 
jacobian is 0F,/dT =0F /dT >0. 

In continuing our interpolation, it will be convenient to use polar coérdi- 
nates. As our lemma on systems of coérdinates only applies to interior 
points, we must extend our transformation beyond the inner boundary, 
u=U=0, to make this curve an interior curve. We do this by putting 


u=U, t=F,(T,0), 


in a narrow strip inside the curves w= U=0. This joins with our preceding 
transformation along these curves, continuity of the derivatives being re- 
tained, since (@F,/8U)v.o=0. Its jacobian is (@F,/8T)v.o>0. 

We next draw two circles of radius a about the points ¢ and C as centers 
respectively, taking a so small that these circles lie entirely inside the curves 
u=0 and U=0. To extend our transformation so that it maps the region 
between «u=0 and the first circle on to that between U =O and the second, 
we introduce polar coédrdinates ¢, r and T, R. Let the equations of u=0 and 
U =0 in these coérdinates be 


r=w(t)=27T), R=V(T), 
where, as in the following equations, all the functions of ¢ and T are of 
period 27. Along these curves we have 
at/aR=0, at/dT = 9r/AR= E(T), =G(T). 
Since dr/dT =(0r/0R) (dR/dT)+0r/dT, we have v'(T)=E(T)V'(T)+ 
G(T). We put 
t=F,(T,0), r=G(R,T), 

where G(R, T) is periodic in T, has 6G/8R>0, and satisfies the further con- 
ditions 

G(a,T) =a, = 1, 

GV,T) =», (0G(R,T)/dR)z-v = E(T). 


The determination of G(R, T) for a fixed value of T reduces to the prob- 
lem of finding a function of R, with positive derivative, which, with its 
derivative takes two positive assigned values at each end of an interval. 
The function £(T) is positive, since 


d(r,t) OF ; OF ; 
= E(T) (=) > 0, and (=) > 0. 
0(R,T)/v-0 0T/v-0 
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We define the auxiliary function H(R) as follows: 
A(R) = 1+ (R—a)(M — 1)/k, asRsartk, 
H(R) =M, atkSRSV—k, 
A(R) = V-kRSERSV. 
Here we take 
) 


and k a positive number so small that 
2(v—a) 
1+E 
This makes H(R) a continuous, broken-line function, with positive ordinate 


throughout, and such that the area under it isv—a. Also, H(a@)=1 and H(V) 
=E. Consequently if we put 


k<i(V—a), k< 


R 
G(R,T) = a+ f H(R)dR, 


it will satisfy all the conditions required of it above. We may take k an 
absolute constant, since the right members of the inequalities it satisfies 
are positive continuous periodic functions of T, and hence have positive 
minima. 

We note that the transformation given above maps our ring shaped 
regions on one another in a one-to-one manner, since the relation ¢= 
F,(T,0) is clearly one-to-one, and if either ¢ or T is fixed, the second relation 
r=G(R, T) is one-to-one, since G/9R>0. On the outer boundary it agrees 
with our former transformation, since 07/9R = E(T) there, and t= 
F(T, 0), r=G(V, T)=0(T). From these we deduce 


dr/dT = (dr/AR)(dR/dT) + dr/dT, or v'(T) = E(T)V'(T) + 


which, combined with the earlier relation 0'(T) = E(T)V’(T)+G(T), shows 
that dr/8T =G(T). The transformation has a positive jacobian, since this 
equals 
In the ring between R=a and R=a/2, corresponding to r=a, r=a/2, 
we put 
r=R, 
t = 47(R — a)*(4R — a)/a® + F,i(T,0)(2R — a)*(Sa — 4R)/a*. 


For R=a, this gives us r=a, 0r/9R=1, =0, t=F,(T, 0), = 
(0F;/0T)v-o, 0t/9R=0. These values agree with those of the last trans- 
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formation at this circle, all except the value of d7/0R following directly, and 
that from the relation 


dr/dT = (dr/8R)(dR/dT) + ar/AT, or 0 = Or/AT. 


For R=a/2, the equations give r=a/2, dr/9R=1, dr/8T =0, t=T, 0t/dT =1, 
01/8R=0. Thus it joins on to the identity along this circle. The transfor- 
mation is one-to-one, since, for fixed r or R, the t-to-T relation is one-to-one. 
For, we have 


4(R — a)*(4R — a) (2R — a)*(S5a — 4R) 
oT a + (= U=0 a’ 


which is clearly positive for a/2<R<a. As this is the value of the jacobian, 
we see that our transformation has a positive jacobian. 


Inside the circles r=a/2, R=a/2, we use the identity transformation 
r,=R,t=T, 


to complete the mapping. As the identity has the same form in all coérdi- 
nates, the singularity of the coérdinates at the pole is of no importance. 
There is now a transformation for each polygon, given in polar coérdinates 
with pole in the polygon, and defined in a region including the polygon in its 
interior. Thus, by our lemma on systems of coérdinates, it may be expressed 
in terms of the particular set of cartesian coérdinates used for T:. The totality 
of these yields a differentiable transformation 7; of the kind required. 


V. ANALYTIC TRANSFORMATIONS 


Weierstrass* has given an example of an analytic function approximating 
to any degree of accuracy a given continuous function of two variables, 
f(x, y), in a finite region R. This approximating function is 


1 
S(x,y) = —- f f 
Jr 


when is sufficiently large. It is easy to show that if f(x, y) possesses con- 
tinuous first partial derivatives over any closed region D interior to R, 
we have 


lim fu(x,y) = f(x,y) 5 lim = af/ax ; lim (Afs/ay) = af/ay 


uniformly over D. 


* Weierstrass, Berliner Sitzungsberichte, 1885. 
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This result enables us to approximate to any biunivocal, differentiable 
transformation by an analytic transformation, the approximation applying 
to the derivatives, but the analytic transformation thus obtained is not 
necessarily one-to-one. For the case in which the differentiable transforma- 
tion has a non-vanishing jacobian, we may show that, when the approxima- 
tion is sufficiently close, it is biunivocal, and shall prove the following 
lemma: 


Lemma 3. If 
X = X(x,y), Y = V(x,y) 


is a continuous, biunivocal transformation, holding in the closed convex region 
s, for which X and Y possess first partial derivatives with respect to x and y, 
continuous in s, and for wizich the jacobian of X and Y with respect to x and 
v never vanishes in s, and 


X = X,(x,y), Y = Y.(x, 4) 


are a pair of differentiable functions, which approximate X and Y, and whose 
first partial derivatives approximate those of X and Y, uniformly in s, then, if 
the approximation is sufficiently close, these equations represent a biunivocal 
transformation for the interior of s. 


We note from the form of the approximating functions, that they take 
each point x, y into a point X., Y., but may take two points x, y into the 
same point X,, Y,. We must show this to be impossible when the approxima- 
tion is sufficiently good. We shall divide the proof into two parts, first proving 
the lemma for the case in which the given functions X, Y are linear functions, 
and then for the general case. 

1. Let 

X=ax+byt+e, Y=cx+dy+f 


be the given transformation. To predict the necessary degree of approxi- 
mation, we make some preliminary calculations. Consider the radius drawn 
out from any fixed point /; to a variable point ~, which moves out along 
this radius in s. If the line makes an angle A with the X axis, we shall have 
X — = a(x — x1) + — = r(acosA + bsin A), 
Y — = — m1) + d(y — y1) = r(ccosA + dsinA). 


For the distance in the transformed figure, S, we have 
R? = (X — Xi)?+ — Yi)? = r°[(a? + cos? A 
+ 2(ab + cd) cos A sin A + (6? + d?) sin 2A!. 


— 
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As A varies, the ratio R*/r? varies between the maximum and minimum 
values 


The product of these is (ad —bc)?, which is the square of the jacobian. This 
might have been predicted since the square root of the product of the maxi- 
mum and minimum values, and the jacobian both measure the ratio of the 
area of a circle in s to that of its corresponding ellipse in S. As the jacobian 
does not vanish, neither the maximum nor minimum value is zero. Let the 
value of the minimum be gq’. Then, for the affine transformation, the mini- 
mum value of R/ris q>0. 

Now consider the image of the variable radius f: p under the approxi- 
mating transformation. Let the transformed radius be P; P, where Pi= 
(X,, Y:) and P=(X, Y). We have for its length 


R = — — 

We calculate its derivative with respect to r, and find 

dR |cosA 

dr 

—Y1)(0¥/dy) ]sinA 
V(X —X1)?+(¥ 

We may write this in a form involving derivatives only, by noting that 
(X—X,) and (Y—Y,) enter, essentially, through their ratio, only. But, 
since the image of the straight line p, p is a differentiable curve, having 


P, P as a chord, there is some intermediate point on this curve for which 
the tangent is parallel to the chord. If this point is P we shall have 


Y—Y,  (d¥/dx) cos A + (@¥/dy) sin A 


X— dX (8 X/dx) cos A + (8 X/dy) sin A 


We notice that both terms of this ratio can not be zero unless the jacobian 
of the transformation is zero at P. We may now write for the derivative 


dR (X.cosA + XysinA)( X,zcosA + X,sinA) 
dr V 
(Y.cosA +Y,sinA)(¥2cosA +V,sinA) 


V (X,cosA + 


where the subscripts indicate partial derivatives. If in this expression we 
replace the partial derivatives X,, X,, Y., Y,, as well as Xz, Xy, Vz, Vy, 
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by a, b, c, d, their values for the affine transformation, it reduces to R/r for 
the affine transformation, or 


(a? + c?) cos? A + 2(ab + cd) cos A sin A + (b? + d*) sin? Af= > 0. 


As the expression is a continuous function of these partial derivatives for 
values in some neighborhood of a, b, c, d, a u4 may be found such that, if 
these partial derivatives are all within wu, of their values for the affine 
transformation, the expression will be >g/2, for a given A. But, as the func- 
tion is continuous in A, for values in some neighborhood of this given value, 
it will be>g/4. By the Heine-Borel theorem, a finite number of such neigh- 
borhoods may be found which include all values of A, and hence a number 
u may be found less than any of the corresponding u,. If the approximation 
has its partial derivatives within u of those of the affine transformation, for 
any A, we shall have dR/dr>q/4>0. 

We now assert that a transformation approximating the affine trans- 
formation to this degree of accuracy is necessarily one-to-one in s. For, if 
it were not, it would take two points in s, p: and 3, into the same point 
P2. Since s is convex, pz ps lies entirely in s. Let p; be a point on the extended 
line ps p2, and P; its transform. Let a variable point p, at distance r from 
pi, move along 2 ps, and let the distance of its transform, P, from P, be R. 
As r changes from /; 2 to pips, R changes from P; P2 back to PiP:. Hence, 
at some point between /, and #3, we must have dR/dr=0. But this is 
impossible, since dR/dr>q/4>0. 

This proves the lemma for the affine case. We notice that here the limit 
on the approximation involves only the derivatives. 

2. Now consider a general differentiable transformation, with non- 
vanishing jacobian in a closed region s. At every point / in this region, we 
form an approximating linear transformation, having the same derivatives 
as the given transformation has at this point. By the first part of this proof, 
we may find a u, for this linear transformation such that any transformation 
whose derivatives approximate its derivatives to within 1, will be one-to-one. 
Since the given transformation has continuous derivatives, we may surround 
each point by a circle of radius v, such that, in this circle, the partial deriva- 
tives never differ from those at the center by more than u,/2. Surround 
each point p by a circle of radius 2,/2._ By the Heine-Borel theorem, we may 
find a finite number of such circles such that each point of s is inside some 
one of them. Let u be the minimum 1, for any of these points, and v the 
minimum 2. We see that, if any point in s is taken as the center of a circle 
of radius 2/2, the partial derivatives of the given transformation will not 
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differ by more than u/2 from some linear transformation for which u may be 
taken as the w of part one. 

Since the given transformation is continuous in s, its inverse is, and from 
the nature of s we may find a V such that, if the images of two points are at 
distance less than V, the points themselves will be at distance less than 
v/4. We now assert that any approximating transformation which takes a 
point in s into a point at distance less than V from its transform under the 
given transformation, and whose derivatives are within u/2 of those of the 
given transformation, is necessarily one-to-one. For, suppose it took two 
points ~, and 2 into the same point P’. Let the transforms of these points 
under the given transformation be P; and Pz. From the nature of our 
approximation, P; P’ and P; P’ are both less than V. Hence P; P is less than 
2V, and f; pz is less than v/2. Draw a circle with center #; and radius 0/2. 
If this circle does not lie entirely in s, the part of it contained in s is convex, 
since s is convex, and hence this part may be used as the region of part one. 
But, in this circle, the derivatives of the given transformation do not differ 
by more than «/2 from those of some linear transformation for which this is 
the « of part one. But, since the derivatives of the approximating transfor- 
mation are within u/2 of those of the given transformation, they are within 
u of those of this linear transformation. Thus the approximating trans- 
formation is one-to-one in this circle, and can not take /; and #, into the 
same point. Thus our lemma is proved. 

We are now in a position to prove 


THEOREM I. Given a pair of continuous functions 
X = X(x,y), = Y(x,y), 
defining a continuous, biunivocal transformation of some closed, two-dimensional 


region r of finite connectivity of the x, y plane, into a closed region R of the X, Y 
plane, and a positive constant ¢, there exists a pair of analytic functions 


x X.(x,y), Y Y.(x,¥), 
defining a biunivocal transformation of some closed region of the x, y plane 


including r, into a closed region of the X, Y plane including R, whose distance 
from the given transformation is less than e. 


For, from the discussion of §2 we may embed the regions r and R in 
two squares s and S, and find a continuous transformation 7; of s on S, 
which agrees with the given transformation in r, and takes the vertices of 
s into those of S. Then we dissect the squares into convex polygons, by 
Lemma 1, such that any continuous biunivocal transformation 7; which maps 
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each polygon of s into the corresponding polygon of S is at distance less 
that €/2 from 7;. By Lemma 2, we then find a pair of functions 


i= X2(x,¥), Y= Y2(x,¥), 


with continuous first partial derivatives, whose jacobian never vanishes in 
s, and which has the properties demanded of 72. Consequently its distance 
from 7; is less than €/2. We next apply Lemma 3 to this transformation to 
find a V and a “/2 such that, if a pair of differentiable functions are given 
which approximate X and Y to within V/V/ 2, and whose derivatives 
approximate those of X and Y to within «/2, they define a one-to-one trans- 
formation ins. Also we find a W such that the transforms of any two points 
in S whose distance is less than W, under the inverse of 72, are at distance 
less than €/2. Finally, we use the method of Weierstrass recalled at the begin- 
ning of this section to find a pair of analytic functions 


X = X3(x,y), Y= Y3(x,¥), 


which approximate X2(x, y) and Y2(x, y) to within €/2/2, V/V2, W/V2, 
and whose derivatives approximate those of these functions to within /2. 
These equations define the analytic transformation whose existence the 
theorem asserts. For, by Lemma 3, it is one-to-one in s. From the choice of 
the approximation, and of W, we have 


[73(p) — T2(p)] <«/2, 
P) — Te(P)] = — < ¢/2. 


Thus the distance of the transformation 7; to 7; is less than ¢/2, and since 
that of JT, to 7; is less than €/2, the distance from 7; to 7; is less than e, 
in s. Since no point of R is within e of the boundary of S, 7; maps s ona 
region including R, and hence 7;~' is defined in a region including R. As the 
distance from 7; to 7; is less than e wherever both are defined, we see that 
T; satisfies all the conditions demanded of it in the theorem. 


CoROLLARY. The functions X3(x, y) and Y3(x, y) of the theorem may be 
taken as a pair of polynomials. 


For we may approximate to any analytic function, as well as to its 
first derivatives, by a polynomial. 


VI. TRANSFORMATIONS ON A SPHERE 


Our argument may be extended to transformations of a sphere into itself 
with but slight modification. A natural definition of an analytic transfor- 
mation on a sphere is a transformation which is expressed by aralytic functions 
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in terms of some siandard set of codrdinates, such as the polar coérdinates 
¢@ and @. To avoid trouble at the poles, we require that the functions be 
analytic and single-valued in ¢ and @ for every possible choice of the axis. 
In our discussion it will be more convenient to use an equivalent form of this 
definition, and note that any transformation of a three-dimensional region, 
not necessarily simply connected, including the points of the sphere as 
interior points, analytic in terms of cartesian coérdinates, which leaves 
the sphere invariant, determines a transformation on the sphere which is 
analytic in the above defined sense, and conversely any transformation 
analytic in the earlier sense may be extended to give an analytic space trans- 
formation of the kind here used. With this definition, we may formulate 


THEOREM II. Given a set of three continuous functions 
X = X(x,y,2), Y = Y(x,y,2), Z = 2Z(x,y,2), 
which leave the unit sphere invariant, 
X°+ 2+ =1, 


and define a continuous biunivocal transformation on this sphere, and a positive 
number ¢, there exists a set of three analytic functions 


X = X.(x,y,2), = Va(x,y,2),Z = Za(x,y,2), 
which leave the unit sphere invariant, 
X2+V24+2Z2 =1ifet+y+2=1, 


and define a biunivocal transformation of the unit sphere into itself whose 
distance from the given transformation is less than e. 


We divide the sphere by two networks of convex polygons, whose sides 
are great circles, the number of polygons meeting at each vertex being three, 
which have the property that any continuous biunivocal transformation T: 
of the sphere into itself, which maps each polygon of the first network on 
the corresponding polygon of the second network, is at distance less than 
e/2 from the given transformation, 7,. This is shown to be possible by the 
method used to prove Lemma I. To set up a differentiable transformation 
T2, we begin with the neighborhood of a vertex and its transform, gnomonic- 
ally project these neighborhoods on the tangent planes at the vertices in 
question, and set up affine transformations in these planes as in the proof 
of Lemma 2. They project back into differentiable transformations on the 
sphere, which will yield affine transformations when projected from the 
center on any tangent plane. To interpolate along a side, we work in the 
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tangent plane at its midpoint, and in that at the midpoint of the correspond- 
ing side, and form the cubic transformations of Lemma 2. These project 
into differentiable transformations on the sphere. For the interpolation 
into the interior of a polygon, we work in the tangent plane at some interior 
point of it, and that at some interior point of its transform, and proceed 
asin Lemma 2. We thus obtain 1 differentiable transformation of the sphere 
into itself which is one-to-one, possesses a non-vanishing jacobian with 
respect to any system of ¢, @ codrdinates, and approximating 7; to within €/2. 
We readily extend this to a region between two spheres concentric with the 
unit sphere, and enclosing it, by adding to the ¢, @ transformation the 
equation R=r. This transformation, when expressed in cartesian coérdinates, 
gives three differentiable functions 


X = X2(x,y,2), ¥ = Y2(x,y,2),Z = Z2(x,y,2), 


such that X?+YV?+2Z? =1 if x?+y?+2?=1. 

We shall now use the considerations of Lemma 3 to show that any 
differentiable transformation in three variables which approximates this 
one, both as to coérdinates and their derivatives, sufficiently closely, will, 
when the transforms of points on the unit sphere under it are projected 
gnomonically on the sphere, yield a one-to-one transformation of the sphere. 
At every point # of the sphere we draw the tangent plane, and project the 
points in some neighborhood of p on this tangent plane. Similarly we project 
their transforms on the tangent plane at the transformed point P. This gives 
a differentiable transformation of a part of one of these planes on the other. 
We form an approximating linear transformation, having the same deriva- 
tives as this transformation at p. By part 1 of Lemma 3, we may find a 
uy for this linear transformation such that any transformation of the plane 
whose derivatives approximate its derivatives to within u, will be one-to-one. 
We surround each point # by a circle of radius v, such that, in the circle 
which is the projection of this on the tangent plane, the partial derivatives 
of the plane transformation never differ from those at the center by more 
than u,/2. We then surround each point on the sphere by a circle of radius 
vp/2. We pick out a finite number of such circles with centers at p; such that 
each point on the sphere is inside some one of them. If, now, wis the minimum 
u, and v the minimum 2, for the corresponding points, if any point g on the 
sphere is taken as the center of a circle of radius 7/2, there is a point ?; 
on the sphere within a distance v/2 of g, such that if the points inside the 
circle at g are projected on the tangent plane to the sphere at #, and their 
transforms are projected on the tangent plane at P;, the transform of ?;, 
a plane differentiable transformation will arise with the property that any 
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other differentiable transformation whose partial derivatives differ from 
those for it by less than u/2 will be one-to-one. 

We now find a V for the transformation on the sphere such that, if 
the images of two points are at distance less than V the points themselves 
will be at distance less than 7/4. Now consider any approximating dif- 
ferentiable transformation, 7;. Project the points in a circle of radius v/2 
about g on the tangent plane at the nearest /;, and their transforms under 7; 
on the tangent plane at P;, the transform of p; under the given transforma- 
tion. A plane differentiable transformation will arise whose partial deriva- 
tives are seen to be continuous functions of the partial derivatives of 73, 
and certain quantities connected with the given transformation and its 
derivatives at p;. Consequently we may find a w’ such that, if the derivatives 
of T; are within u’ of those of the given transformation, the derivatives 
of the plane transformation just found will be within u/2 of the plane trans- 
formation previously found for p;. Thus the plane transformation will be 
one-to-one inside the circle of radius 7/2. If 7; in addition approximates 
the given transformation to within V in distance, it will project on the sphere 
into a transformation one-to-one throughout. For, if it took two points 
fp: and f2 into the same point P’, whose transforms under the given trans- 
formation were P; and P2, we should have P,P’ and P,P’ less than V, 
and hence P,P; less than 2V. Thus fif2 would be less than 7/2, and a circle 
with center at f; and radius v/2 could be drawn in which the projected 
transformation on the tangent plane at the nearest p; would be one-to-one, 
contradicting the assumption that /; and /2 gave a single point P’. 

We now find, by the Weierstrass method quoted in Section 5, applied 
to functions of three variables, a set of three analytic functions X;(x, y, 2), 
Y3(x, y, 2), Z3(x, y, 2) approximating the functions defining the differentiable 
transformation, X2(x, y, z), V2(x, y, 2), Z2(x, y, 2), to within €/4,/3, V/V/3, 
W//3, and whose derivatives approximate those of these functions to 
within «’, where W is chosen such that two points of the sphere whose dis- 
tance is less than W, under the inverse of the differentiable transformation, 
go into two points whose distance is less than €/4. We write, finally, 


X3(x,y,2) 
Y3(x,y,2) 

Z3(x,¥,2) 
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as the approximating analytic transformation. It clearly takes the sphere 
into itself, is one-to-one, and is at distance less than e from the given trans- 
formation 7), since it is at distance less than €/2 from 73. 


COROLLARY. Given a continuous, biunivocal transformation of any simply 
connected closed surface into itself, a positive number € and any set of curves 
on the surface deformable into meridians and parallels, we may find an analytic 
transformation, the analyticity being given in terms of the given curves, whose 
distance from the given transformation, measured in the same terms, is less thane. 


For we have merely to deform the closed surface into a sphere, and the 
curves into meridians and parallels, and apply Theorem II. 
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ERRATA, VOLUME 28 


M. H. IncrawAm, Solution of certain functional equations relative to a general 
linear set. 


Page 287, insert (1) at left of second displayed formula; 


Page 289, formula (i) should read v.).0 =1 
last line above footnote should read 


= II hic (A1,h) ; 


Page 292, in right member of second displayed equation above the 
summation sign, for + h” read h.” 


Page 293, in the first line of the proof of Theorem 2, after “distinct” 
insert ‘determinations of coefficients for’’; 
fourth line from bottom should read 
hy ‘ 


A 
third line from bottom should read 
w (hi )=ki (p—1) 


ky Ay 


last line, above second summation sign for “w(h,)=(p—1)” 
read “w(h,)=k(p—1).” 
T. H. GRONWALL, On the zeros of the function B(z) associated with the gamma 
function. 


Page 395, line next to the bottom Gosteete) for “[f(z) —g(z)]g(z)” read 
“Lf(z) 


Page 397, line 13, for ““1+O((log m)/m)®” read 
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